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Abstract 

We consider the hard-core model with Metropolis transition probabilities on finite grid graphs and investigate 
the asymptotic behavior of the first hitting time between its two maximum-occupancy configurations in the low- 
temperature regime. In particular, we show how the order-of-magnitude of this first hitting time depends on the 
grid sizes and on the boundary conditions by means of a novel combinatorial method. Our analysis also proves the 
asymptotic exponentiality of the scaled hitting time and yields the mixing time of the process in the low-temperature 
limit as side-result. In order to derive these results, we extended the model-independent framework in (27l for first 
hitting times to allow for a more general initial state and target subset. 
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1 Introduction 

Hard-core lattice gas model. In this paper we consider a stochastic model where particles in a finite volume dynam¬ 
ically interact subject to hard-core constraints and study the first hitting times between admissible configurations 
of this model. This model was introduced in the chemistry and physics literature under the name "hard-core lattice 
gas model" to describe the behavior of a gas whose particles have non-negligible radii and cannot overlap | [21[l36| . 
We describe the spatial structure in terms of a finite undirected graph A of N vertices, which represents all the 
possible sites where particles can reside. The hard-core constraints are represented by edges connecting the pairs 
of sites that cannot be occupied simultaneously. We say that a particle configuration on A is admissible if it does not 
violate the hard-core constraints, i.e. if it corresponds to an independent set of the graph A. The appearance and 
disappearance of particles on A is modeled by means of a single-site update Markov chain {with Metropo¬ 
lis transition probabilities, parametrized by the fugacity A > 1. At every step a site v of A is selected uniformly 
at random; if it is occupied, the particle is removed with probability 1 / A; if instead the selected site v is vacant, 
then a particle is created with probability 1 if and only if all the neighboring sites at edge-distance one from v are 
also vacant. Denote by X(A) the collection of independent sets of A. The Markov chain {A } f cN is ergodic and 
reversible with respect to the hard-core measure with fugacity A on 1(A), which is defined as 

" A(f) := MK)' ' e I(A) - (1) 

where Z^(A) is the appropriate normalizing constant (also called partition function). The fugacity A is related to the 
inverse temperature f of the gas by the logarithmic relationship log A — p. 

We focus on the study of the hard-core model in the low-temperature regime where A —» oo (or equivalently p —> 
oo), so that the hard-core measure y\ favors maximum-occupancy configurations. In particular, we are interested 
in how long it takes the Markov chain to "switch" between these maximum-occupancy configurations. 

Given a target subset of admissible configurations A C 1(A) and an initial configuration x <f_ A, this work mainly 
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focuses on the study of the first hitting time of the subset A for the Markov chain {Xf} fg jvj with initial state x at 
time t — 0. 

Two more application areas. The hard-core lattice gas model is thus a canonical model of a gas whose particles have 
a non-negligible size, and the asymptotic hitting times studied in this paper provide insight into the rigid behavior 
at low temperatures. Apart from applications in statistical physics, our study of the hitting times is of interest for 
other areas as well. The hard-core model is also intensively studied in the area of operations research in the context 
of communication networks l23l . In that case, the graph A represents a communication network where calls arrive 
at the vertices according to independent Poisson streams. The durations of the calls are assumed to be independent 
and exponentially distributed. If upon arrival of a call at a vertex i, this vertex and all its neighbors are idle, the call 
is activated and vertex i will be busy for the duration of the call. If instead upon arrival of the call, vertex i or at least 
one of its neighbors is busy, the call is lost, hence rendering hard-core interaction. In recent years, extensions of this 
communication network model received widespread attention, because of the emergence of wireless networks. A 
pivotal algorithm termed CSMA l37l which is implemented for distributed resource sharing in wireless networks 
can be described in terms of a continuous-time version of the Markov chain studied in this paper. Wireless devices 
form a topology and the hard-core constraints represent the conflicts between simultaneous transmissions due to 
interference f37]. In this context A is therefore called interference graph or conflict graph. The transmission of a data 
packet is attempted independently by every device after a random back-off time with exponential rate A, and, if 
successful, lasts for an exponentially distributed time with mean 1. Hence, the regime A —> oo describes the scenario 
where the competition for access to the medium becomes fiercer. The asymptotic behavior of the first hitting times 
between maximum-occupancy configurations provides fundamental insights into the average packet transmission 
delay and the temporal starvation which may affect some devices of the network, see [39]. 

A third area in which our results find application is discrete mathematics, and in particular for algorithms de¬ 
signed to find independent sets in graphs. The Markov chain {Xf can be regarded as a Monte Carlo algorithm 
to approximate the partition function Z^(A) or to sample efficiently according to the hard-core measure p,\ for A 
large. A crucial quantity to study is then the mixing time of such Markov chains, which quantifies how long it takes 
the empirical distribution of the process to get close to the stationary distribution p\. Several papers have already 
investigated the mixing time of the hard-core model with Glauber dynamics on various graphs (3j [T9] [20, 34] . By 
understanding the asymptotic behavior of the hitting times between maximum-occupancy configurations on A as 
A —> oo, we can derive results for the mixing time of the Metropolis hard-core dynamics on A, which in general is 
smaller than for the usual Glauber dynamics, as illustrated in Il25l . 

Results for general graphs. The Metropolis dynamics in which we are interested for the hard-core model can be 
put, after the identification eP — A, in the framework of reversible Freidlin-Wentzel Markov chains with Metropolis 
transition probabilities (see Section [2] for precise definitions). Hitting times for Freidlin-Wentzel Markov chains are 
central in the mathematical study of metastability. In the literature, several different approaches have been intro¬ 
duced to study the time it takes for a particle system to reach a stable state starting from a metastable configuration. 
Two approaches have been independently developed based on large deviations techniques: the pathwise approach, 
first introduced in [6] and then developed in 1311 [32] . 331 , and the approach in mmmm. Other approaches to 
metastability are the potential theoretic approach |4j |5) and, more recently introduced, the martingale approach (l] [2], 
see m for a more detailed review. 

In the present paper, we follow the pathwise approach, which has already been used to study many finite- 
volume models in a low-temperature regime, see IITTIT2llT5llT6llT7ll24l!29lf30| . where the state space is seen as an 
energy landscape and the paths which the Markov chain will most likely follow are those with a minimum energy 
barrier. In I l3ll [32[ [33] the authors derive general results for first hitting times for the transition from metastable 
to stable states, the critical configurations (or bottlenecks) visited during this transition and the tube of typical 
paths. In [27] the results on hitting times are obtained with minimal model-dependent knowledge, i.e. find all 
the metastable states and the minimal energy barrier which separates them from the stable states. We extend the 
existing framework l27l in order to obtain asymptotic results for the hitting time r[ for any starting state x, not 
necessarily metastable, and any target subset A, not necessarily the set of stable configurations. In particular, we 
identify the two crucial exponents T_ (x. A) and T_|_ (x. A) that appear in the upper and lower bounds in probability 
for rf in the low-temperature regime. These two exponents might be hard to derive for a given model and, in 
general, they are not equal. However, we derive a sufficient condition that guarantees that they coincide and also 
yields the order-of-magnitude of the first moment of rf on a logarithmic scale. Furthermore, we give another 
slightly stronger condition under which the hitting time rf normalized by its mean converges in distribution to an 
exponential random variable. 

Results for rectangular grid graphs. We apply these model-independent results to the hard-core model on rect- 
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angular grid graphs to understand the asymptotic behavior of the hitting time t„, where e and o are the two 
configurations with maximum occupancy, where the particles are arranged in a checkerboard fashion on even and 
odd sites. Using a novel powerful combinatorial method, we identify the minimum energy barrier between e and 
o and prove absence of deep cycles for this model, which allows us to decouple the asymptotics for the hitting time 
Tg and the study of the critical configurations. In this way, we then obtain sharp bounds in probability for rjj, since 
the two exponents coincide, and find the order-of-magnitude of Er® on a logarithmic scale, which depends both 
on the grid dimensions and on the chosen boundary conditions. In addition, our analysis of the energy landscape 
shows that the scaled hitting time / E is exponentially distributed in the low-temperature regime and yields 
the order-of-magnitude of the mixing time of the Markov chain {Xf 

By way of contrast, we also briefly look at the hard-core model on complete K-partite graphs, which was already 
studied in continuous time in [38). While less relevant from a physical standpoint, the corresponding energy land¬ 
scape is simpler than that for grid graphs and allows for explicit calculations for the hitting times between any pair 
of configurations. In particular, we show that whenever our two conditions are not satisfied, T_(x, A) f T+ (x, A) 
and the scaled hitting time is not necessarily exponentially distributed. 


2 Overview and main results 

In this section we introduce the general framework of Metropolis Markov chains and show how the dynamical 
hard-core model fits in it. We then present our two main results for the hitting time r® for the hard-core model on 
grid graphs and outline our proof method. 


2.1 Metropolis Markov chains 


Let X be a finite state space and let H : X —» R be the Hamiltonian, i.e. a non-constant energy function. We con¬ 
sider the family of Markov chains {Xj }feN on X with Metropolis transition probabilities T’p indexed by a positive 
parameter ft 




q(x,y)e H ( x )l + / if xf^y, 

1 - Ez^x p fi( x >z), it x — y, 


( 2 ) 


where q : X x X —>• [0,1] is a matrix that does not depend on ft. The matrix q is the connectivity function and we 
assume it to be 


• Stochastic, i.e. Ei/e* <]{x,y) — 1 for every x £ X; 

• Symmetric, i.e. q{x,y) = q{y,x) for every x,y £ X; 

• Irreducible, i.e. for any x, y £ X, x 7 ^ y, there exists a finite sequence co of states oq,..., co„ £ X such that 
lo\ — x, co n — y and q(cOj, oy + 1 ) > 0, for i — 1,..., n — 1. We will refer to such a sequence as a path from x to y 
and we will denote it by co : x —> y. 

We call the triplet (X, H, q) an energy landscape. The Markov chain {xf } f€ ]N is reversible with respect to the Gibbs 
measure 

e -m*) 

H{X) '' = <3) 

Furthermore, it is well-known (see for example |9} Proposition 1.1]) that the Markov chain {xf }fgN is aperiodic 
and irreducible on X. Hence {X^} f€ jvj is ergodic on X with stationary distribution pp. 

For a nonempty subset A C X and a state x £ X, we denote by rft th e first hitting time of the subset A for the 

Markov chain {Xj’} fe i^ with initial state x at time t — 0, i.e. 


r x A := inf{f > 0 : xf £ A | X^ = x}. 

Denote by X s the set of stable states of the energy landscape (X, H, q), that is the set of global minima of H on X, 
and by X m the set of metastable states, which are the local minima of H in X \ X s with maximum stability level (see 
Section[3]for definition). The first hitting time rf is often called tunneling time when x is a stable state and the target 
set is some A C X s \ {x}, or transition time from metastable to stable when x £ X m and A = X s . 
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2.2 The hard-core model 


The hard-core model on a finite undirected graph A of N vertices evolving according to the dynamics described in 
Section[l]can be put in the framework of Metropolis Markov chains. Indeed, we associate a variable u(v) £ {0,1} 
with each site v £ A, indicating the absence (0) or the presence (1) of a particle in that site. Then the hard-core 
dynamics correspond to the Metropolis Markov chain determined by the energy landscape (X, H, q) where 

• The state space X C {0,1} A is the set of admissible configurations on A, i.e. the configurations cr £ (0,1} A such 
that cr(v)cr(w) — 0 for every pair of neighboring sites v, iv in A; 

• The energy of a configuration cr £ df is H(cr) := ~LveA^)i 

• The connectivity function q allows only for single-site updates (possibly void), i.e. for any a, cr' £ X, 


q(a,a') 


[ JI' if|{z?eA 

< 0, if |{p £ A 

90,'7), if <T = o i - 


a(v) + = 1 , 

a(v) ^ a'{v )}| > 1 , 


For A = eP the hard-core measure m on A is precisely the Gibbs measure © associated with the energy landscape 
(X,H,q). 

Our main focus in the present paper concerns the dynamics of the hard-core model on finite two-dimensional 
rectangular lattices, to which we will simply refer to as grid graphs. More precisely, given two integers K,L> 2, 
we will take A to be a K x L grid graph with three possible boundary conditions: Toroidal (periodic), cylindrical 
(semiperiodic) and open. We denote them respectively by Tk,l, Ck,l and G^j. Figure [I] shows an example of the 
three possible types of boundary conditions. 





Figure 1: Examples of grid graphs with different boundary conditions 


There are in total N — KL sites in A. Every site v £ A is described by its coordinates (zq, vf), and since A is 
finite, we assume without loss of generality that the leftmost (respectively bottommost) site of A has the horizontal 
(respectively vertical) coordinate equal to zero. A site is called even {odd) if the sum of its two coordinates is even 
(odd, respectively) and we denote by V e and V 0 the collection of even sites and that of odd sites of A, respectively. 

The open grid Gk,l is naturally a bipartite graph: All the neighbors in A of an even site are odd sites and 
vice versa. In contrast, the cylindrical and toroidal grids may not be bipartite, so that we further assume that K 
is an even integer for the cylindrical grid C^,l and that both K and L are even integers for the toroidal grid Tk,l- 
Since the bipartite structure is crucial for our methodology, we will tacitly work under these assumptions for the 
cylindrical and toroidal grids in the rest of the paper. As a consequence, Tk,l and Ck,l are balanced bipartite graphs, 
i.e. | Ve| = |V 0 |. The open grid Gk,l has \ V e \ — {KL/ 2] even sites and |V 0 | = [KL/2 J odd sites, hence it is a balanced 
bipartite graphs if and only if the product KL is even. We denote by e (o respectively) the configuration with a 
particle at each site in V e (V n respectively). More precisely. 


e(v) 


1 if v £ V e , 
0 if v £ V 0 , 


and o(v) 


0 if v £ V e , 
1 ifveVg. 
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Note that e and o are admissible configurations for any choice of boundary conditions, and that H(e) = — | V e — 
— [XL/2] and H(o) — — | V^, | = — [XL/2J. In the special case where A = Gk,l with KL = 1 (mod 2), H(e) < H(o) 
and, as we will show in Section [51 X s — {e} and X m — {o}. In all the other cases, we have H(e) — H( o) and 
X s — {e, o}; see Section|5]for details. 

2.3 Main results and proof outline 

Our first main result describes the asymptotic behavior of the tunneling time for any rectangular grid A in the low- 
temperature regime ft —> oo. In particular, we prove the existence and find the value of an exponent T(A) > 0 that 
gives an asymptotic control in probability of r* on a logarithmic scale as ft —> oo and characterizes the asymptotic 
order-of-magnitude of the mean tunneling time IEt®. We further show that the tunneling time normalized by its 
mean converges in distribution to an exponential unit mean random variable. 

Theorem 2.1 (Asymptotic behavior of the tunneling time t£). Consider the Metropolis Markov chain {xf }t e M corre¬ 
sponding to hard-core dynamics on a K x L grid A as described in Subsection 12.21 There exists a constant T(A) > 0 such 
that 

(i) For every e > 0, lim^,*, ^( r ( A ) -£ ) < < e^( r ( A )+ £ )^ = 1 , 

(ii) lim i logEr® = T(A), 

p—>oo p 

(iii) Exp(l), as/3 —>■ oo. 

In the special case where A — Gk,l with KL = 1 (mod 2), (i), (ii), and (iii) hold also for the first hitting time r°, but 
replacing T (A) by T(A) — 1. 

Theorem l2.1l relies on the analysis of the hard-core energy landscape for grid graphs and novel results for hitting 
times in the general Metropolis Markov chains context. We first explain these new model-independent results and, 
afterwards, we give details about the properties we proved for the energy landscape of the hard-core model. 

The framework lf27l focuses on the most classical metastability problem, which is the characterization of the 
transition time t'P between a metastable state t] £ X m and the set of stable states X s . However, the starting 
configuration for the hitting times we are interested in, is not always a metastable state and the target set is not 
always X s . In fact, the classical results can be applied for the hard-core model on grids for the hitting time r° only 
in the case of an K x L grid with open boundary conditions and odd side lengths, i.e. KL = 1 (mod 2). Many other 
interesting hitting times are not covered by the literature, including: 

• The hitting time when A is a K x L grid with open boundary conditions and odd side lengths, i.e. KL = 1 
(mod 2), which is a transition from the unique stable state e to the metastable state o; 

• The hitting times = t° when A is an K x L grid with KL = 0 (mod 2) for any boundary conditions, since 
the configurations e and o are both stable states; 

• The hitting time between any pair of local minima when A is a complete X-partite graph. 

We therefore generalize the classical pathwise approach Il27l to study the first hitting time rf for a Metropolis 
Markov chain for any pair of starting state x and target subset A. The interest of extending these results to the tunneling 
time between two stable states was already mentioned in [27 33j, but our framework is even more general and we 
could study rf for any pair (x, A), e.g. the transition between a stable state and a metastable one. 

Our analysis relies on the classical notion of cycle, which is a maximal connected subset of states lying below 
a given energy level. The exit time from a cycle in the low-temperature regime is well-known in the literature |9, 
llOl [T3] m l33l and is characterized by the depth of the cycle, which is the minimum energy barrier that separates 
the bottom of the cycle from its external boundary. The usual strategy presented in the literature to study the first 
hitting time from x £ X m to A — X s is to look at the decomposition into maximal cycles of the relevant part of 
the energy landscape, i.e. X \ X s . The first model-dependent property one has to prove is that the starting state 
x is metastable, which guarantees that there are no cycles in X \ X s deeper than the maximal cycle containing the 
starting state x, denoted by C^(x). In this scenario, the time spent in maximal cycles different from C/\ (x), and 
hence the time it takes to reach X s from the boundary of C/ (x), is comparable to or negligible with respect to the 
exit time from C/> (x), making the exit time from C/| (x) and the first hitting time rf of the same order. 
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In contrast, for a general starting state x and target subset A all the maximal cycles of X \ A can potentially have 
a non-negligible impact on the transition from x to A in the low-temperature regime. By analyzing these maximal 
cycles and the possible cycle-paths, we can establish bounds in probability of the hitting time r\ on a logarithmic 
scale, i.e. obtain a pair of exponents T_ (x. A), T +(x, A) such that for every e > 0 

Urn P /3 (e^ r -(*' A )“ e ) < rf < e /Kr+(*,A)+ £ )^ = 1 

The sharpness of the exponents T_ (x. A) and T + (x. A) crucially depends on how precisely one can determine which 
maximal cycles are likely to be visited and which ones are not, see Section[3]for further details. Furthermore, we give 
a sufficient condition (see Assumption A in Section O, which is the absence of deep typical cycles, which guarantees 
that r_(x,A) — T = r+(x,A), proving that the random variables j- log converge in probability to T as f —> oo, 
and that lim^oo j, log Erjj = F. In many cases of interest, one could show that Assumption A holds for the pair 
(x,A) without detailed knowledge of the typical paths from x to A. Indeed, by proving that the model exhibits 
absence of deep cycles (see Proposition 13 .1 8t . similarly to 1271 , also in our framework the study of the hitting time 
rf is decoupled from an exact control of the typical paths from x to A. More precisely, one can obtain asymptotic 
results for the hitting time rf in probability, in expectation and in distribution without the detailed knowledge of 
the critical configuration or of the tube of typical paths. Proving the absence of deep cycles when x G X m and 
A — X s corresponds precisely to identifying the set of metastable states X m , while, when x G X s and A = X s \ {x}, 
it is enough to show that the energy barrier that separates any state from a state with lower energy is not bigger 
than the energy barrier separating any two stable states. 

Moreover, we give another sufficient condition (see Assumption B in Section [3]), called "worst initial state" as¬ 
sumption, to show that the hitting time rf normalized by its mean converges in distribution to an exponential unit 
mean random variable. However, checking Assumption B for a specific model can be very involved, and hence we 
provide a stronger condition (see Proposition 13.20b . which includes the case of the tunneling time between stable 
states and the classical transition time from a metastable to a stable state. The hard-core model on complete K-partite 
graphs is used as an example to illustrate scenarios where Assumption A or B is violated, T_ (x. A) f F | (x, A ) and 
the asymptotic result for Er’j of the first moment and the asymptotic exponentiality of rf/Erf do not hold. 

In the case of the hard-core model on a rectangular grid A, we develop a powerful combinatorial approach which 
shows the absence of deep cycles (Assumption A) for this model, concluding the proof of Theorem l2.ll Furthermore, 
it yields the value of the energy barrier T (A) between e and o, which turns out to depend both on the grid size and 
on the chosen boundary conditions. This is illustrated by the next theorem, which is our second main result. 

Theorem 2.2 (The exponent F(A) for rectangular grids). Let A a K x L rectangular grid. Then the energy barrier T(A) 
between e and o appearing in Theorem [2T71 takes the values 

( min{K, L} + 1 if A — T K/L , 

r(A)= lmm{\K/2],\L/2]}+l if A — G KL , 

[min{K/2,L} + l ifA = C K , L . 

The crucial idea behind the proof of Theorem 12. 2l is that along the transition from e to o, there must be a critical 
configuration where for the first time an entire row or an entire column coincides with the target configuration o. 
In such a critical configuration particles reside both in even and odd sites and, due to the hard-core constraints, an 
interface of empty sites should separate particles with different parities. By quantifying the "inefficiency" of this 
critical configuration we get the minimum energy barrier that has to be overcome for the transition from e to o to 
occur. The proof is then concluded by exhibiting a path that achieves this minimum energy and by exploiting the 
absence of other deep cycles in the energy landscape. 

Lastly, we show that by understanding the global structure of an energy landscape (X, H, q) and the maximum 
depths of its cycles, we can also derive results for the mixing time of the corresponding Metropolis Markov chains 
as illustrated in Subsection 13.81 In particular, our results show that in the special case of an energy land¬ 
scape with multiple stable states and without other deep cycles, the hitting time between any two stable states and 
the mixing time of the chain are of the same order-of-magnitude in the low-temperature regime. This is the case 
also for the Metropolis hard-core dynamics on grids, see Theorem l5.4l in Section[5] 

The rest of the paper is structured as follows. Section [3] is devoted to the model-independent results valid for a 
general Metropolis Markov chain, which extend the classical framework |27) . The proofs of these results are rather 
technical and therefore deferred to Section 0] In Section 0 we develop our combinatorial approach to analyze the 
energy landscapes corresponding to the hard-core model on grids. We finally present in Section[ 6 ]our conclusions 
and indicate future research directions. 
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3 Asymptotic behavior of hitting times for Metropolis Markov chains 

In this section we present model-independent results valid for any Markov chains with Metropolis transition prob¬ 
abilities 0 defined in Subsection l2.ll In Subsection l3.1 I we introduce the classical notion of a cycle. If the considered 
model allows only for a very rough energy landscape analysis, well-known results for cycles are shown to readily 
yield upper and lower bounds in probability for the hitting time rf: indeed, one can use the depth of the initial cycle 
Ca(x) as T fx, A ) (see Propositions 13.4b and the maximum depth of a cycle in the partition of X \ A as T + (x, A) 
(see Proposition l3.7l l. If one has a good handle on the model-specific optimal paths from x to A, i.e. those paths along 
which the maximum energy is precisely the min-max energy barrier between x and A, sharper exponents can be 
obtained, as illustrated in Proposition 13.101 by focusing on the relevant cycle , where the process started in 

x spends most of its time before hitting the subset A. We even further sharpen these bounds in probability for the 
hitting time t x a with Proposition l3.15l by studying the tube of typical paths from x to A or standard cascade, a task that 
in general requires a very detailed but local analysis of the energy landscape. To complete the study of the hitting 
time in the regime f> —> oo, we prove in Subsection 13.51 the convergence of the first moment of the hitting time rjj 
on a logarithmic scale under suitable assumptions (see Theorem 13.171 and give in Subsection 13.61 sufficient condi¬ 
tions for the scaled hitting time xf/TErf to converge in distribution as f -X- co to an exponential unit mean random 
variable, see Theorem 13. 191 Furthermore, we illustrate in detail two special cases which fall within our framework, 
namely the classical transition from a metastable state to a stable state and the tunneling between two stable states, 
which is the relevant one for the model considered in this paper. In Subsection l3.7l we briefly present the hard-core 
model on a complete k-partite graph, which is an example of a model where the asymptotic exponentiality of the 
scaled hitting times does not always hold. Lastly, in Subsection l3.8l we present some results for the mixing time and 
the spectral gap of Metropolis Markov chains and show how they are linked with the critical depths of the energy 
landscape. 

In the rest of this section and in Section [TJ {Xf } fP [\j will denote a general Metropolis Markov chain with energy 
landscape (A, H, q) and inverse temperature f, as defined in Subsection 12.II 

3.1 Cycles: Definitions and classical results 

We recall here the definition of cycle and present some well-known properties. 

Recall that a path co : x —> y has been defined in Subsection 12 .1 1 as a finite sequence of states <x >\,..., co n E X 
such that oq = x, co n — y and q(cOi, C 0 j + 1 ) > 0, for i = 1,..., n — 1. Given a path co — (oq,..., co n ) in X , we denote 
by |<u| n its length and define its height or elevation by 

Oo, := max H(cOj). (4) 

A subset A C X with at least two elements is connected if for all x, y £ A there exists a path co : x —> y, such that 
C 0 j E A for every i — 1,... ,\co\. Given a nonempty subset A C X and x f A, we define CI X/ a as the collection of all 
paths co : x —> y for some y £ A that do not visit A before hitting y, i.e. 

Cl X 'A {co : x ^ y \ y E A, coi yL AM i < |o;|}. (5) 

We remark that only the endpoint of each path in Ci x ^ belongs to A. The communication energy between a pair 
x, 1/ £ X is the minimum value that has to be reached by the energy in every path co : x —? y, i.e. 

A>(x,y) := min (6) 

(O'.x—>y 

Given two nonempty disjoint subsets A,BcX, we define the communication energy between A and B by 

<J>(A,B):= min <P(x,y). (7) 

xe A,yeB 

Given a nonempty set A C X, we define its external boundary by 

dA {y i A | 3x E A : q(x,y) > 0}. 

For a nonempty set A C X we define its bottom T(A) as the set of all minima of the energy function H(-) on A, i.e. 

X(A) := (y E A : H(y) = min H(x)}. 

xEA 
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Let X s T(X ) be the set of stable states, i.e. the set of states with minimum energy. Since X is finite, the set X s is 

always nonempty Define the stability level V x of a state x £ X by 


V x :=<S>{x,l x )-H(x), 


( 8 ) 


where X x (z £ X : H(z) < Hix )} is the set of states with energy lower than x. We set V x oo if X x is empty, 
i.e. when x is a stable state. The set of metastable states X m is defined as 

X m := {x e X : V x = max V z }- (9) 

zex\x s 

We call a nonempty subset CcTa cycle if it is either a singleton or it is a connected set such that 

maxH(x) < HLF(3C)). (10) 

xeC 

A cycle C for which condition (flOb holds is called non-trivial cycle. If C is a non-trivial cycle, we define its depth as 


T(C) :=H(^(3C))-H(^(C)). 


( 11 ) 


Any singleton C = {x} for which condition dlTTb does not hold is called trivial cycle. We set the depth of a trivial 
cycle C to be equal to zero, i.e. T(C) — 0. Given a cycle C, we will refer to the set J-(riC) of minima on its boundary 
as its principal boundary. Note that 



H(x) if C = {x} is a trivial cycle, 

H(J- (dC )) if C is a non-trivial cycle. 


In this way, we have the following alternative expression for the depth of a cycle C, which has the advantage of 
being valid also for trivial cycles: 

T(C) = ®(C,X\C)-H(F(C)). (12) 

The next lemma gives an equivalent characterization of a cycle. 

Lemma 3.1. A nonempty subset C C X is a cycle if and only if it is either a singleton or it is connected and satisfies 


max <J>(x,y) < <£>(C, X \ C). 

x,y€C 


The proof easily follows from definitions Q and (flOb and the fact that if C is not a singleton and is connected, 
then 



We remark that the equivalent characterization of a cycle given in Lemma 13.11 is the "correct definition" of 
a cycle in the case where the transition probabilities are not necessarily Metropolis but satisfy the more general 
Friedlin-Wentzell condition 



where A (x,y) is an appropriate rate function A : X 2 —> 1R + U {co}. The Metropolis transition probabilities corre¬ 
spond to the case (see fl~4l for more details) where 



The next theorem collects well-known results for the asymptotic behavior of the exit time from a cycle as f> 
becomes large, where the depth T(C) of the cycle plays a crucial role. 

Theorem 3.2 (Properties of the exit time from a cycle). Consider a non-trivial cycle C C X. 


(i) For any x e C and for any e > 0, there exists k\ > 0 such that for all f> sufficiently large 
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(ii) For any i€C and for any e > 0, there exists ki > 0 such that for all f sufficiently large 


P pfrtc > ^ (r(C)+£) ) < e~' k2l> . 

(iii) For any x,y £ C, there exists £3 > 0 such that for all f> sufficiently large 

^ > r a * c ) < e-W. 

(iv) There exists £4 > 0 such that for all f> sufficiently large 

supP^X^ £ JP(3C)) < e~ k *P. 


(v) For any x £ C, e > 0 and e' > 0,/or all f> sufficiently large 

P^(Tf c < ^ (r(c)+£) , X r * c £ T(dC)) > e- £ 'P. 


(vi) For any x £ C, any e > 0 and all f sufficiently large 


e mc)-t) < Er x < e mc)+t)' 


The first three properties can be found in [33], Theorem 6.23], the fourth one is [33| Corollary 6.25] and the fifth 
one in l27l Theorem 2.17]. The sixth property is given in [31, Proposition 3.9] and implies that 


UmlbgE^IXC). 


(15) 


The third property states that, given that C is a cycle, for any starting state x £ C, the Markov chain {Xf}f g ]N visits 
any state y £ C before exiting from C with a probability exponentially close to one. This is a crucial property of the 
cycles and in fact can be given as alternative definition, see for instance [91, TO]. The equivalence of the two defini¬ 
tions has been proved in [14] in greater generality for a Markov chain satisfying the Friedlin-Wentzell condition 11 4k 
Leveraging this fact, many properties and results from 0 will be used or cited. 


We denote by C(X) the set of cycles of X. The next lemma, see 135] Proposition 6.8], implies that the set C(X) 
has a tree structure with respect to the inclusion relation, where X is the root and the singletons are the leafs. 

Lemma 3.3 (Cycle tree structure). Two cycles C,C' £ C(X) are either disjoint or comparable for the inclusion relation, 
i.e. C C C' or C C C. 

Lemma 13.31 also implies that the set of cycles to which a state x £ X belongs is totally ordered by inclusion. 
Furthermore, we remark that if two cycles C,C' £ C(X ) are such that C C C', then T(C) < T(C/; this latter 
inequality is strict if and only if the inclusion is strict. 


3.2 Classical bounds in probability for hitting time t x a 

In this subsection we start investigating the first hitting time rf. Thus, we will tacitly assume that the target set A 
is a nonempty subset of X and the initial state x belongs to X \ /\. Moreover, without loss of generality, we will 
henceforth assume that 

A — {y£X\Mco:x^y con A f 0}, (16) 

which means that we add to the original target subset A all the states in X that cannot be reached from x without 
visiting the subset A. Note that this assumption does not change the distribution of the first hitting time rf, since 
the states which we may have added in this way could not have been visited without hitting the original subset A 
first. 

Given a nonempty subset A C X and x £ X, we define the initial cycle C 4 (x) by 

Ca{x) {x} U {z £ X : <h(x,z) < 0(x, A)}. (17) 
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If x £ A, then C A (x) — {x} and thus is a trivial cycle. If x <f A, the subset C A (x) is either a trivial cycle (when 
<b(x, A) — H(x)) or a non-trivial cycle containing x, if <b(x, A) > H(x). In any case, if x yL A, then C A (x) flA-0. 
For every x £ X, we denote by F(x, A) the depth of the initial cycle C A (x), i.e. 

r(x,A) :=r(C A (x)). 

Clearly if C A (x) is trivial (and in particular when x £ A), then F(x, A ) = 0. Note that by definition the quantity 
F(x, A) is always non-negative, and in general 

F(x, A) = <S>{x,A) -H{F{C A (x))) > 4>(x,A) -H(x), 

with equality if and only if x £ J r (C A (x)). 

If x <f A, then the initial cycle C A (x) is, by construction, the maximal cycle (in the sense of inclusion) that 
contains the state x and has an empty intersection with A. Therefore any path co : x —> A has at some point to 
exit from C A (x), by overcoming an energy barrier not smaller than its depth F(x, A). The next proposition gives a 
probabilistic bound for the hitting time zf by looking precisely at this initial ascent up until the boundary of C A (x). 

Proposition 3.4 (Initial-ascent bound). Consider a nonempty subset A C X and x fL A. For any e > 0 there exists k > 0 
such that for f> sufficiently large 

Fp(rf < e m*' A )-^ < e -K/». (18) 

The proof is essentially adopted from l33l and follows easily from Theorem 13.21 11. since by definition of C A (x), 
we have that t* > st r* Q(;c) . 

Before stating an upper bound for the tail probability of the hitting time rf, we need some further definitions. 
Given a nonempty subset B C X, we denote by A4(B) the collection of maximal cycles that partitions B, i.e. 

M(B) := {C £ C(X) : C C B, C maximal}. (19) 

Lemma [3.31 implies that every nonempty subset B C X has a partition into maximal cycles and hence guarantees 
that A4(B) is well defined. Note that if C £ C(X ) is itself a cycle, then Ad(C) = {C}. The importance of the notion 
of initial cycle besides Proposition [3A] is partially explained by the next lemma. 

Lemma 3.5. | |27l Lemma 2.26] Given a nonempty subset A C X, the collection {C A (x)} Ig< y^ A of initial cycles is the 
partition into maximal cycles of X\ A, i.e. 

M{X\A) = {C A {x)} xeXXA . 

We can extend the notion of depth to subsets B C X which are not necessarily cycles by using the partition 
of B into maximal cycles. More precisely, we define the maximum depth T(B) of a nonempty subset B C Las the 
maximum depth of a cycle contained in B, i.e. 

f(B) := max F(C). (20) 

CeM(B) 

Trivially F(C) = T(C) ifC £ C(X). The next lemma gives two equivalent characterizations of the maximum depth 
f (B) of a nonempty subset B CL. 

Lemma 3.6 (Equivalent characterizations of the maximum depth). Given a nonempty subset B CL, 

r(B) = maxT(x, X\B)= max ( min <b(x,y) — H(x)\. (21) 

XGB X^B v. y(=:X\B ' 

In view of Lemma 13.61 T(B) is the maximum initial energy barrier that the process started inside B possibly 
has to overcome to exit from B. As illustrated by the next proposition, one can get a (super-)exponentially small 
upper bound for the tail probability of the hitting time rf, by looking at the maximum depth F (X \ A) of the 
complementary set X \ A, where the process resides before hitting the target subset A. 

Proposition 3.7 (Deepest-cycle bound). (9j Proposition 4.19] Consider a nonempty subset ACL and x £ A. For any 
e > 0 there exists k' > 0 such that for f> sufficiently large 

Fp(r x A >e^ x \ A ^) <e~^. ( 22 ) 

By definition we have T(x, A) < T(X \ A), but in general F(x, A) f Y(X \ A) and neither bound presented 
in this subsection is actually tight, so we will proceed to establish sharper but more involved bounds in the next 
subsection. 
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3.3 Optimal paths and refined bounds in probability for hitting time t x a 

The quantity T(x, A) appearing in Proposition 13.41 only accounts for the energy barrier that has to be overcome 
starting from x, but there is such an energy barrier for every state z (/ A and it may well be that to reach A it is 
inevitable to visit a state z with T(z, A) > T(x, A). Similarly, also the exponent T( X \ A) appearing in Propositi on 13.71 
may not be sharp in general. For instance, the maximum depth T{X \ A) could be determined by a deep cycle C 
in X \ A that cannot be visited before hitting A or that is visited with a vanishing probability as /3 —> oo. In this 
subsection, we refine the bounds given in Propositions l3.4l and l3.7l by using the notion of optimal path and identifying 
the subset of the state space X in which these optimal paths lie. 

Given a nonempty subset A C X and x fL A, define the set of optimal paths 14°^ as the collection of all paths 
a’ 6 Cl X/A along which the maximum energy <5\,, is equal to the communication height between x and A, i.e. 

n°5 := {co e n X:A ■. - <I>(x,A)}. (23) 

Define the relevant cycle C\ (x) as the minimal cycle in C(X) such that C A (x) C C A (x), i.e. 

C+(x) := min{C G C{X) \ C A {x) C C}. (24) 

The cycle Cf (x) is well defined, since the cycles in C(X) that contain x are totally ordered by inclusion, as remarked 
after Lemma [3.31 By construction, Ct (x) f] A f Q) and thus C A (x) contains at least two states, so it has to be a non¬ 
trivial cycle. The minimality of Cf (x) with respect to the inclusion gives that 

max H(z ) = <L(x, A), 

zecj(x) 

and then, by using Lemma l3dl one obtains 

<J>(x, A) < H(X{dC+(x))). (25) 

The choice of the name relevant cycle for Cf (x) comes from the fact that all paths the Markov chain will follow 
to go from x to A will almost surely not exit from C A (x) in the limit /3 —t oo. Indeed, for the relevant cycle C A (x) 
Theorem 13. 21 31) reads 

^ p f( T i <T kiw) =1 - (26) 

The next lemma, which is proved in Section [4j states that an optimal path from x to A is precisely a path from x to 
A that does not exit from Cf(x). 

Lemma 3.8 (Optimal path characterization). Consider a nonempty subset A c X and x £ A. Then 

co G co G Ct X/A and to C C^(x). 

Lemma r3.8l implies that the relevant cycle Cf (x) can be equivalently defined as 

C aM = {y G * I <&(x,y) < <b(x, A)| — |y G A” : <b(x,y) < <J>(x, A) + 5 0 /l}, (27) 

where <$o is the minimum energy gap between an optimal and a non-optimal path from x to A, i.e. 

<5o — 5q{x, A) min <L a , — <L(x, A). 

In view of Lemma [3. 8 1 and d26t . the Markov chain started in x follows in the limit f —> oo almost surely an optimal 
path in 0°^ to hit A. It is then natural to define the following quantities for a nonempty subset A C X and x f A\ 


^min(^/ A) 

:= min maxT(z,A), 
a>en° p j 

(28) 


max maxT(z,A). 

a,en°Pj 

(29) 
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Definition ( f28l> implies that every optimal path co G has to enter at some point a cycle in JvUX "\ A) of depth at 
least T' rrnn (x, A), while definition (f29t means that every cycle visited by any optimal path co G 0°^ has depth less 
than or equal to 'F max (x / A). 

An equivalent characterization for the energy barrier Y max (x, A) can be given, but we first need one further 
definition. Define Ra( x ) as the subset of states which belong to at least one optimal path in fl°^, i.e. 

Ra{x) ■— {y G X | 3a; G G° p ^ : y G co}. (30) 

Note that A D Ra{ x ) 7 ^ 0/ since the endpoint of each path in C1 X/ a belongs to A, by definition <(5jl. In view of 
Lemma [3.81 Ra( x ) Q Cf (x). We remark that this latter inclusion could be strict, since in general R ,-\ (x) ^ C A (x). 
Indeed, there could exist a state y G Cf (x) such that all paths co : x —> y that do not exit from Cf (x) always visit 
the target set A before reaching y, and thus they do not belong to 0°^ (see definitions 101 and (f23b l . see Figured 




(a) The subset Ra(x) (in light gray) (b) The partition into maximal cycles of Ra(x), including 

the initial cycle Ca(x) (in dark gray) 

Figure 2: Example of an energy landscape X with highlighted the subset A (in black), the relevant cycle Ct (x) and 
the subset C^(x) \ (Ra{ x ) U A) (with diagonal mesh) 

The next lemma characterizes the quantity Y max (x, A) as the maximum depth of the subset R y \ (x) \ A (see 
definition (f 20 ll ). 

Lemma 3.9 (Equivalent characterization of T r max (x, A)). 

^max(^/ A) — T(Ra(x) \ A). (31) 

Using the two quantities T r rn i n (x, A) and T max (x, A), we can better control in probability the hitting time rj, as 
stated in the next proposition, which is proved in Section [4] 

Proposition 3.10 (Optimal paths depth bounds). Consider a nonempty subset A C X and x G X \ A. For any e > 0 
there exists k > 0 such that for f> sufficiently large 

P^( t a < < e~ K P, (32) 

and 

Fpirf > x(*^)+e)j < (33) 

This proposition is in fact a sharper result than Propositions 13. 41 and !3. 71 since 

r(x. A) < Y min (x, A) < Y max (x, A) < f(X \ A). (34) 

Indeed, since the starting state x trivially belongs to every path in Q°^, we have that T(x, A) < max zea , T(z. A) for 
every co G and thus T(x, A) < Y m j n (x, A). Furthermore, since by definition C^(x) \ A C X\A, Lemma [3.91 
yields that T max (x, A) < T(A’\ A). 

It follows from (f34b that, if F(x, A) = Y(X \ A), then T m j n (x, A) = T r max(^/0)- However, in general, the 
exponents Y m j n (x, A) and 1 F max (x, A) are not equal and may not be sharp either, as illustrated in the fictitious 
energy landscape in Figure [3] 
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(a) Energy profile of the energy landscape with the initial cycle Ca (x) (in grey) and the 
relevant cycle Cf (x) (below the dashed black line) 


» o o o o o 



(b) Partition into maximal cycles of X \ A for the same energy landscape 
Figure 3: An example energy landscape for which Y max (x, A) is not sharp 


In this example, there are two paths to go from x to A: The path co which goes from x to y and then follows 
the solid path until A, and the path a/, which goes from x to y and then follows the dashed path through z and 
eventually hitting A. Note that Ty, = — <E>(x, A ), so both co and co' are optimal paths from x to A. By inspection, 

we get that Y max (x, A) — T(z, A). However, the path a/ does not exit the cycle Ca (y) passing by its principal 
boundary and, in view of Theorem l3.2I iv). it becomes less likely than the other path as [1 —> co. In fact, the transition 
from x to A is likely to occur on a smaller time-scale than suggested by the upper bounds in Proposition l3.10l and in 
particular the exponent Y max (x, A) is not sharp in this example. 

In the next subsection, we will show that a more precise control in probability of the hitting time Tf is possible, 
at the expense of a more involved analysis of the energy landscape. 

3.4 Sharp bounds for hitting time r\ using typical paths 

As illustrated at the end of the previous subsection, the exponents Y mjn (x, A) and Y max (x, A) which control in prob¬ 
ability the hitting time t{ may not be sharp in general. In this subsection we obtain exponents that are potentially 
sharper than Y min (x, A) and Y max (x, A) by looking in more detail at the cycle decomposition of Cjr (x) \ A and by 
identifying inside it the tube of typical paths from x to A. In particular, we focus on how the process moves from two 
maximal cycles in the partition of (x) \ A and determine which of these transitions between maximal cycles are 
the most likely ones. 

Some further definitions are needed. We introduce the notions of cycle-path and a way of mapping every path 
co G Q t/ /i into a cycle-path C w . Recall that for a nonempty subset A C X, d A is its external boundary and IF (A) is 
its bottom, i.e. the set of the minima of the energy function H in A. A cycle-path is a finite sequence (Q,..., C m ) of 
(trivial and non-trivial) cycles C\, ..., C m G C{X) such that 

Q n C /+1 — 0 and 3Q Hi C i+ j f 0, for every i — 1,.. .,m — 1. 

It can be easily proved that, in a cycle-path (C\, ..., C m ), if C, is a non-trivial cycle for some i — 1,..., m, then its 
predecessor Q_i and successor C; + i (if any) are trivial cycles, see fl3l Lemma 2.5]. We can consider the collection 
V X/ a of cycle-paths that lead from x to A and consist of maximal cycles in X "\ A only, namely 

F x ,a ■— {cycle-path (Ci,..., C m ) \ C\,.. ., C,„ G M {X \ A), x G C 1r dC m DA^0}. (35) 
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We define a mapping Cl x A — > V x / \ by assigning to a path co — (co\,... ,u’ n ) £ Cl xA the cycle-path C w — 
(Ci,...,C m ( a ,j) £ V X/ a as follows. Set to = 1, Ci = Ca(x), and then define recursively 

ti min{ft: > f,_ 1 | ^ C/} and Q +1 C^(cUf i ). 

The path a; is a finite sequence and co n £ A, so there exists an index m () £ N such that cot,^ = co n £ A and 
there the procedure stops. The way the sequence (Ci,..., C m ^) is constructed shows that it is indeed a cycle-path. 
Moreover, by using the notion of initial cycle C^(-) to define Ci,..., C tn ( wj , they are automatically maximal cycles 
in A4(X \ A). Lastly, the fact that ai £ CI X/ a implies that x £ C\ and that dC m t w \ n A 0, hence C w £ V X/ a and 
the mapping is well-defined. We remark that this mapping is not injective, since two different paths in C1 XJ 4 can 
be mapped into the same cycle-path in V X/ a- In fact, a single cycle-path groups together all the paths that visit the 
same cycles (the same number of times and in the same order). Cycle-paths are the correct mesoscopic objects to 
investigate while studying the transition x —> A: Indeed one neglects in this way the microscopic dynamics of the 
process and focuses only on the relevant mesoscopic transitions from one maximal cycle to another. 

Furthermore, we note that for a given path co £ Ci x a, the maximum energy barrier along co is the maximum 
depth in its corresponding cycle-path C w , i.e. 


maxT(z, A) — max T(C). 

zew ceCc 

We say that a cycle-path (Q,..., C m ) is connected via typical jumps to A or simply vtj -connected to A if 

iF{dCj) fl Q+i 0, for every i — 1,.. .,m — 1, and tF{dC m ) n A ^ 0. 

The next lemma, presented in |14|, guarantees that there always exists a cycle-path from the initial cycle Ca (x) that 
is vtj-connected to A for any nonempty target subset A C X and x A. 

Lemma 3.11. | Til Proposition 3.22] For any nonempty subset A C X and x A, there exists a cycle-path C* — 
(Ci,..., C m *) vtj-connected to A with x £ C\ and C \,..., C* n C X \ A. 

By inspecting the proof of H4l Proposition 3.22], one notices that the given cycle-path C* — (Ci,..., C m *) consists 
only of maximal cycles in X \ A, i.e. C \,..., C m - £ A4 (X \ A), and in particular Ci = C/\ (x). Hence C* £ V Xi a and 
therefore the collection V X/ a is not empty. 

We define co £ C1 X/ a to be a typical path from x to A if its corresponding cycle-path C w is vtj-connected to A, and 
we denote by the collection of all typical paths from x to A, i.e. 

Q x \ {<^ G Ci X/ A | co is typical}. 

The existence of a vtj-connected cycle-path C* — (Q,..., C m *) E V X/ a guarantees that 

Indeed, take 1/0 = x, yi £ ^(dQ) Cl C,-+ 1 , i — 1,..., m* — 1 and y m * £ ) fl A and consider a path co* that visits 

precisely the saddles y 0 ,.. ■ ,y m * in this order and stays in cycle C ; between the visit to y\~\ and y L . Then co* is a 
typical path from i' to A. The next lemma gives an equivalent characterization for a typical path from i' to A. 

Lemma 3.12 (Equivalent characterization of a typical path). Consider a nonempty subset A C X and x fL A. Then 

co £ co £ Cl X/A and 0(cu !+ i, A) < <b(a>;. A) i = 1,... ,\co\. 

The proof of this result is presented in SectionH] In particular. Lemma 13. 12l shows that every typical path from x 
to A is an optimal path from x to A, i.e. 

<= 06 ) 

since if co £ then <!>(a;;. A) < <b(aq. A) — 0(x, A) for every i — 2,.. .,\co\ and thus <£> 0 , — 0(x, A). 

Let T^(x) be the tube of typical paths from x to A, which is defined as 

Ta(x) -.= {yex | : y£co}. (37) 
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In other words, T^(x) is the subset of states y G X that can be reached from x by means of a typical path which 
does not enter A before visiting y. The endpoint of every path in belongs to A, thus T a (x) fl A ^ 0. Since 
by (f36b every typical path is an optimal path, it follows from definitions (l30l) and (f37l> that 

Ta(*) C R a (x). 

The tube of typical paths can be visualized as the standard cascade emerging from state x and reaching eventually 
A, in the sense that it is the part of the energy landscape that would be wet if a water source is placed at x and the 
water would "find its way" until the sink, that is subset A. This standard cascade possibly consists of basins/lakes 
(non-trivial cycles), saddle points (trivial cycles) and waterfalls (trivial cycles). From definition (f37il . it follows that 
if z £ T^(x), then 

T^(z) CT A (x). (38) 



Figure 4: Example of an energy landscape with the tube of typical T a{%) highlighted in gray 


Denote by Tyi(x) the collection of cycles C £ M. (<T \ A) for which there exists a vtj-connected cycle-path 
Ci,..., C n C X \ A with Ci = Ca(x) and C„ — C, i.e. 

Tyi(x) :={CeM(T\A) | 3 Ci,..., C„ vtj-connected cycle-path with Q = Ca(x) and C n = C}. 

Note that the cycles in ‘Ia(x) form the partition into maximal cycles of (x) \ A, i.e. 

'Z a (.x) = M(J a {x)\A), 


and that, by construction, there exists C £ T a(x) such that T(f)C) D A ^ 0. The boundary of Ta ( x ) consists of 
either states in A or of states which belong to the non-principal part of the boundary of a cycle C G T 'a(x), that is 
dC \ J-(dC). In other words, 

dT A (x)\A= U (dC \J r (dC)). (39) 

cei A (x) 

The typical paths in are the only ones with non-vanishing probability of being visited by the Markov chain 
{Xfjfgjvj started in x before hitting A in the limit j3 —t oo, as illustrated by the next lemma which is proved in 
Section 111 


Lemma 3.13 (Exit from the typical tube T^(x)). Consider a nonempty subset A C X and x ^ A. Then there exists k > 0 
such that for f> sufficiently large 


F p{ T dT A ( X ) ^ T a) < e K/3 - 

Given a nonempty subset A C X and x (f_ A, define the following quantities: 


©min (x, A) min maxr(z,A), 

_„vtj zea> 


(40) 


and 


0max(hd) max maxT(z,A). 


(41) 





In other words, definition fiOll means that every typical path co G has to enter at some point a cycle of depth 

at least 0 m in (x, A). On the other hand, definition <l29li implies that all cycles visited by any typical path co G 
have depth less than or equal to 0 ma x(^, A). Hence, © max (x, A) can equivalently be characterized as the maximum 
depth (see definition ll20ll l of the tube (x) of typical paths from x to A, as stated by the next lemma. 

Lemma 3.14 (Equivalent characterization of 0 m ax(*/ A)). 

0max(x,A) = f(T A (x)\A) - max T(C). (42) 

Since by (f36b every typical path from x to A is an optimal path from x to A, definitions ll28l l. (129b . J40b and (PlTb 
imply that 

Ymin (x,A) < 0 min (x, A) ^ ©max (x,A)< 

^max(^/ A). (43) 

We now have all the ingredients needed to formulate the first refined result for the hitting time x\, which is proved 
in Section [4] The next proposition yields a control in probability for the hitting time of by looking at the shallozvest- 
typical gorge inside T^(x) that the process has to overcome to reach A and at the deepest-typical gorge inside T^(x) 
where the process has a non-vanishing probability to be trapped before hitting A. 

Proposition 3.15 (Typical-cycles bounds). Consider a nonempty subset A C X and x ^ A. For any e > 0 there exists 
k > 0 such that for f sufficiently large 


Fpl 



| < e~*l, 

(44) 

Pp( 

A 


) < e~ K P. 

(45) 


The proof, which is a refinement of that of Proposition l3.10l is presented in Section|U 

In general, the exponents 0 m m(x, A) and 0 m ax (x, A) may not be equal, as illustrated in the fictitious energy 
landscape in Figure [5] 



(a) Energy profile of the energy landscape with the initial cycle Ca(x) (in grey) and the 
relevant cycle Cf ( x ) (below the dashed black line) 
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(b) Partition into maximal cycles of X \ A for the same energy landscape 
Figure 5: An example energy landscape for which 0 m i n (x, A) < 0 m ax (x, A) 

Also in this example, there are two paths to go from x to A: The path co which goes from x to y and then 
follows the solid path until A, and the path co ', which goes from x to y and then follows the dashed path through 
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z and eventually hitting A. Both paths co and a/ always move from a cycle to the next one visiting the principal 
boundary, hence they are both typical paths from x to A. By inspection, we get that 0 max (x, A) — T(z, A), since 
the typical path co' visits the cycle C,\ (z). Using the path co we deduce that 0 min (x, A) — T(y, A) and therefore 
©min(*/^) < 0 max(^/ A). 

If the two exponents 0 m m(x, A) and 0 m ax(x, X) coincide, then Proposition 13.151 gives a sharp control in proba¬ 
bility for the hitting time rjj, as stated in the next corollary. 

Corollary 3.16. Consider a nonempty subset A C X and x ^ A. Assume that 

®min(^/^) — 0(x, A) — 0 max (x, A). (46) 

Then, for any e > 0 

lim Pfl ( e /5(0(^A)-e) < T x < e fS(0(x,A)+e)\ = L (47) 

jS—>oo ^ V / 

There are many examples of models and pairs (x. A) for which 0 m i n (x, A) — 0 max (x, A). The most classical 
ones are the models that exhibit a metastable behavior. If one takes x £ X m and A = X s , then it follows that 
0 m j n (x, A) = V x — ©max(x. A) (recall the definition |8j of stability level) and Corollary 13.161 holds, see also [27’ 
Theorem 4.1]. 

3.5 First moment convergence 

We now turn our attention to the asymptotic behavior of the mean hitting time Erjj as /3 —> oo. In particular, 
we will show that it scales (almost) exponentially in fi and we will identify the corresponding exponent. There 
may be some sub-exponential pre-factors, but, without further assumptions, one can only hope to get results on a 
logarithmic scale, due to the potential complexity of the energy landscape. We remark that a precise knowledge of 
the tube of typical paths is not always necessary to derive the asymptotic order of magnitude of the mean hitting 
time Eijj, as illustrated by Proposition [3T8] 

To prove the convergence of the quantity ^ log ErJ, we need the following assumption. 

Assumption A (Absence of deep typical cycles) Given the energy landscape (X, H, q), zve assume 
(Al) 0 m i n (x, A) = 0(x,A) = 0 max (x,A), and 

(A2) ©max(z. A) < &(x, A) for every z G X \ A. 

Condition (Al) says that every path co : x —> A visits one of the deepest typical cycles of the tube T^(x). 
Condition (A2) guarantees that by starting in another state z x, the deepest typical cycle the process can enter 
is not deeper than those in T^ (x). Checking the validity of Assumption A can be very difficult in general, but we 
give a sufficient condition in Proposition l3.18l which is satisfied in many models of interest, including the hard-core 
model on rectangular lattices presented in Subsection 12.21 which will be revisited in Section [5] We further remark 
that (Al) is precisely the assumption of Corollarv l3.16l Therefore, in the scenarios where Assumption A holds, we 
also have the asymptotic result d47b in probability for the hitting time rf. 

The next theorem says that if Assumption A is satisfied, then the asymptotic order-of-magnitude of the mean 
hitting time Ei^ as fi —> oo is 0(x, A). 

Theorem 3.17 (First moment convergence). If Assumption A is satisfied, then 

1 

lim — logE-rf = 0(x, A). 

(i->oo p 

In many models of interest, calculating T{X \ A) is easier than calculating 0 m ; n (x, A) or 0 max (x, A). Indeed, even 
if t(X \ A) is a quantity that still requires a global analysis of the energy landscape, one needs to compute just 
the communication height <3>(z. A) between any state z E X \ A and the target set A, without requiring a full 
understanding of the complex cycle structure of the energy landscape. Besides this fact, the main motivation to 
look at the quantity T{X \ A) is that it allows to give a sufficient condition for Assumption A, as illustrated in the 
following proposition. 

Proposition 3.18 (Absence of deep cycles). If 

<F(x, A) - H(x) = f {X \ A), (48) 

then Assumption A holds. 
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Proof. From the inequality 


®{x,A) -H(x) < 0 min (x,A) < ©max(x,A) < T(X \ A), 

we deduce that 0 miri (x, A) = 0 m ax(^, A) and (Al) is proved. Moreover, by definition of T(X \ A), we have 
0max(z, A) < T(X \ A) for every z G X \A. This inequality, together with the fact that 0 m ax(*/ A) = T(X \ A), 
proves that (A2) also holds and thus assumption A is satisfied. □ □ 

We now present two interesting scenarios for which J48l l holds. 

Example 1 (metastability scenario) 

Suppose that 

x G X m and A = X s . 

In this first scenario, rf is the classical transition time between a metastable state and a stable state, a widely studied 
object in the statistical mechanics literature (see, e.g. |[27f). Assumption A is satisfied in this case by applying Propo- 
sition I 3 TT 8 I since condition d48l> holds: The equality <T>(x, X s ) — H(x) = T(X \ X s ) follows from the assumption 
x G X m , which means that there are no cycles in X \ X s that are deeper than Cx s (x). 

Example 2 (tunneling scenario) 

Suppose that x G X s , A = X s \ {x} and 

<3>(z. A) — H(z) < <3>(x,A) — H(x) VzGT\{i}. (49) 

In the second scenario, the hitting time rjj is the tunneling time between any pair of stable states. Assumption d49t 
says that every cycle in the energy landscape which does not contain a stable state has depth strictly smaller than 
the cycle C^(x) and we generally refer to this property as absence of deep cycles. This condition immediately im¬ 
plies that d48b holds, i.e. T (X \A)— <E>(x, A) — H(x), and hence in this scenario assumption A holds, thanks to 
Proposition 13.181 

The hard-core model on grids introduced in Subsection l2.2l falls precisely in this second scenario and, by proving 
the validity of Assumption A, we will get both the probability bounds d47t and the first-moment convergence for 
the tunneling time . 

3.6 Asymptotic exponentiality 

We now present a sufficient condition for the scaled random variable to converge in distribution to an 

exponential unit mean random variable as f —» 00. Define 

©*(x. A) lim -^logEr^. (50) 

/5 —>co p 

If Assumption A holds, then we know that 0(x, A) = 0*(x, A), but the result presented in this section does not 
require the exact knowledge of 0* (x. A). We prove asymptotic exponentiality of the scaled hitting time under the 
following assumption. 

Assumption B ("Worst initial state") Given an energy landscape (X, H, q), we assume that 

0*(x, A) > T{X \ (A U {x})). (51) 


This assumption guarantees that the following "recurrence" result holds: From any state z G X the Markov 
chain reaches the set A U {x} on a time scale strictly smaller than that at which the transition x —» A occurs. Indeed, 
Proposition 13.71 gives that for any e > 0 

lim supPflfrf 1 ,,, > = 0. 

We can informally say that Assumption B requires x to be the "worst initial state" for the Markov chain when the 
target subset is A. 

Proposition 13.201 gives a sufficient condition for Assumption B to hold, which is satisfied in many models of 
interest, in particular in the hard-core model on grid graphs described in Subsection l2.2l 
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Theorem 3.19 (Asymptotic exponentiality). If Assumption B is satisfied, then 



A Exp(l), 


f> —> oo. 


( 52 ) 


More precisely, there exist two functions ki(/3) and ^((3) with lim^_ > , 00 fci(/5) = 0 and lim^oo ^(/S) 
s > 0 



<fc 1 (/5)e-( 1 - fc2 ^) s . 


0 such that for any 


The proof, presented in Section[U readily follows from the consequences of Assumption B discussed above and 
by applying [151 Theorem 2.3], 

We now present a condition which guarantees that Assumption B holds and show that it holds in two scenarios 
similar to those described in the previous subsection. 


Proposition 3.20 ("The initial cycle Ca (x) is the unique deepest cycle"). If 


T{x,A) > r(*\(AU {*})), 


(53) 


then Assumption B is satisfied. 

The proof of this proposition is immediate from ( f34l> and (|43t ) . We remark that if condition ( T53b holds, then the 
initial cycle C,\ (x) is the unique deepest cycle in A’ \ Condition d53T) is stronger than (151 1 , but often easier to 
check, since one does not need to compute the exact value of 0* (x, A), but only the depth T(x,A) of the initial cycle 
Ca (x) . We now present two scenarios of interest. 


Example 3 (unique metastable state scenario) 

Suppose that 

X m — {z}, A — X s , and x E Ca(z). 

We remark that this scenario is a special case of the metastable scenario presented in Example 1 in Subsection 13.51 
This scenario was already mentioned in 1271 . in the discussion following Theorem 4.15, but we briefly discuss here 
how to prove asymptotic exponentiality within our framework. Indeed, we have that 


r(x,x s ) = r(Cxs( z )) = r(x\x s ), 


thanks to the fact that z is the configuration in X \ X s with the maximum stability level, which means that Cx»(z) 
is the deepest cycle in X \ X s . Moreover, the fact that z is the unique metastable state, implies that 

f(X \ X s ) > f(X \ (X s U {z})). 


since every configuration in X \ ( X s U {z}) has stability level strictly smaller than V z - 


Example 4 (two stable states scenario) 

Suppose that 


X s = {si,s 2 }, A — {S 2 }, x E C A (si) and r(,T \ {si,s 2 }) < <b(si,s 2 ) — H(si). 

This scenario is a special case of the tunneling scenario presented in Example 2 in Subsection 13.51 In this case 
condition d53b is obviously satisfied. In particular, it shows that the scaled tunneling time rl 1 between two stable 
states in X is asymptotically exponential whenever X s — {si,S 2 } and the condition T (X \ {si,S 2 }) < ( bfs 1 ,S 2 ) — 
H(si) is satisfied. 

In Section [5] we will show that for the hard-core model on grids Assumption B holds, being precisely in this 
scenario, and obtain in this way the asymptotic exponentiality of the tunneling time between the two unique stable 
states. 
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3.7 An example of non-exponentiality 

Assumption B is a rather strong assumption. In fact, for many models and for most of choices of x and A, the scaled 
hitting time does not have an exponential distribution in the limit f —> oo. Moreover, we do not claim that 

Assumption B is necessary to have asymptotically exponentiality of the scaled hitting time t^/Et^. However, we 
will now show that for the hard-core model on K-partite graphs Assumption B does not hold and that the model 
exhibits non-exponentially distributed scaled hitting times. 

Take A to be a complete K-partite graph. This means that the sites in A can be partitioned into K disjoint sets 
£>i,..., Bk called components, such that two sites are connected by an edge if and only if they belong to different 
components, see Figure [ 6 (a)| 



(a) K-partite graph A with K = 5 



(b) State space X corresponding to the graph A 


Figure 6 : Exampe of a K-partite graph A and of the resulting energy landscape for the hard-core model on A 

This choice for A results in a simpler state space X , for which a detailed analysis is possible. Moreover, for the 
same model the asymptotic behavior of the first hitting times between maximal-occupancy configurations is already 
well understood, see l38l . Before stating the results, we need some further definitions. Let L/ c be the size of the k-th 
component B^, for k — 1,..., K. Clearly the total number of sites in A is N = Y^=i ^k- Define L m ax := maxj c =i,...,k L/t- 
For k = 1,,K define the configuration cy A A as 




1 if v e B^, 

0 otherwise. 


The configurations {w-\ ,..., ct k } are all the local minima of the energy function H on the state space X. Moreover 
( 7 /t is a stable state if and only if L/ ; = L max . In addition, denote by 0 the configuration in X where all the sites 
are empty, i.e. the configuration such that 0(f) = 0 for every v 6 A. Given k\, Ay A {1,- ■ -,K}, k\ f Ay, we 
take ( 7 /tj and j^ 2 as starting and target configurations, respectively. Define L* = L* (/C 2 ) maxty^ L/ c and let 
K, — K t (k 2 ) ■— \k A "2 : L/ c = L,} be the set of indices of the components of size L t different from /C 2 - 

In 1381 the same model has been considered, but in continuous time; the results therein (Theorems IV.l and IV.2) 
can be translated to discrete time as follows. Given two functions /(/3) and y(/3), we write / ~ g as /3 —t 00 when 
lim = 1 . 


Proposition 3.21 (First moment convergence of the hitting time r^ 1 ). For any k ^,Ay G {1 ,,K} with k\ 7 ^ fc 2 , the 




first hitting time satisfies 


TC a h 
et ^ 7 


N 


1 {k 1 eK*} \K* 


^k 2 




f —r 00 . 


In particular, 


lim \ log IE r ^ 1 
f 6 ak 2 


L^. 
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Proposition 3.22 (Asymptotic distribution of the hitting time T^ 1 ). Take k\, A 2 G {1,..., K} such 
thatk\ 7 ^ A 2 . lfk\ G X*, then 


-\4exp(1), /3->oo. 
Et * 1 


Instead, ifk\ £ X*, t/zezz 




Er, 




4z, 


% 


£ -> oo. 


za/zere Z = £f=i Y,- and (Y;);>i zzre i.i.d. exponential unit mean random variables and M is an independent random variable 
with geometric distribution P(M = n) — (1 — p) n pfor n G N U {0} with p — Lkl 


|.K*|L*+Lfc 9 


As illustrated in Figure | 6 (b)( the energy landscape consists of X cycles, one for each component of A, and one 
trivial cycle {0} which links all the others. The depth of each of the cycles is equal to the size of the corresponding 
component of A. All the paths from ay to cy must at some point exit from the cycle corresponding to component 
k \, at whose bottom lies ay. After hitting the configuration 0, they can go directly into the target cycle, i.e. the one 
at which bottom lies cy 2 , or they may fall in one of the other X — 1 cycles. Formalizing these simple considerations, 
we can prove the following proposition. 


Proposition 3.23 (Structural properties of the energy landscape). For any k\, Ay G {!,..., X}, k\ ^ Ay, 


{fTfcj}) Efcj 1 f r min( cr J: 1 /{G/c 2 }), 

and 

YmaxK, HJ) = U= ¥(X \ WJ). 

In particular, if k\ K„ (A 2 ), then it follows from Propositions 13. 21 1 and 13.231 that 

L * = = 0 K' £ r(Af \ {cr kl ,cr k2 }) = L*. 


Hence Assumption B is not satisfied for the the pair (ay, {ay}). Indeed there exists another configuration cy, for 
some A 7 G X* (fc 2 ), A 7 7 ^ A|, for which the recurrence probability 


P* 


T > 


e ^( L q+ £ ) 


does not vanish as /3 —>• 00 , since component £y has size L* > Ly. As illustrated in Proposition 13.221 the scaled 

cty (Tfc 

hitting time r^VEr ^ 1 does not converge in distribution to an exponential random variable with unit mean as 
j 6 —» 00. 


3.8 Mixing time and spectral gap 

In this subsection we focus on the long-run behavior of the Metropolis Markov chain {xf} f c;N and in particular 
examine the rate of convergence to the stationary distribution. We measure the rate of convergence in terms of the 
total variation distance and the mixing time, which describes the time required for the distance to stationarity to 
become small. More precisely, for every 0 < e < 1, we define the mixing time f™ lx (e) by 

fg“(e) min{zz > 0 | max ||Pg(x, •) - fz«(-)|| TV < e}, 
r xeX e 

where ||v — v 7 ||xy := \ YlxeX \ v ( x ) ~ v '( x ) I f° r an Y two probability distributions v,v' on X. Another classical notion 
to investigate the speed of convergence of Markov chains is the spectral gap, which is defined as 

1 ( 2 ) 

Pp :=1 ~ay, 

where 1 = > • • • > ajj > —1 are the eigenvalues of the matrix (Pp(x,y)) Xi y G x- The spectral gap can be 

equivalently defined using the Dirichlet form associated with the pair ( Pp, pp), see [26} Lemma 13.12]. The problem 
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of studying the convergence rate towards stationarity for a Friedlin-Wentzell Markov chain has already been studied 
in mmmm. In particular, in 0 the authors characterize the order of magnitude of both its mixing time and 
spectral gap in terms of certain "critical depths" of the energy landscape associated with the Friedlin-Wentzell 
Markov chain. We summarize the results in the context of Metropolis Markov chains in the next proposition. 

Proposition 3.24 (Mixing time and spectral gap for Metropolis Markov chains). For any 0 < e < 1 and any s £ X s , 

^ozt T {e)=nX\{s}) = hm -^logpp. (54) 

Furthermore, there exist two positive constants 0 < ci < C 2 < °o independent of f such that 

Cie -m*\w) < H < C 2 e -m*\w) v /3 > o. (55) 

4 Proof of results for general Metropolis Markov chain 

In this section we prove the results presented in Section [3] for a Metropolis Markov chain {x|’} fe [\j with energy 
landscape ( X, H, q) and inverse temperature f. For compactness, we will suppress the implicit dependence on the 
parameter f> in the notation. 

4.1 Proof of Lemma [3781 

If co £ n° p ], then trivially co £ Fl xA . Moreover, we claim that to £ implies co C Cf (x). Indeed, by definition 
of an optimal path and inequality J25b . it follows that an optimal path cannot exit from C~t (x) since 

<S> u = <S>(x,A)<H(r(dCZ(x))). 

The reverse implication follows from the minimality of C^(x), which guarantees that 4>(x, A) — max, gC i ^ H(z). 

□ 


4.2 Proof of Proposition 13.101 

We first prove the lower bound (f32l> and, in the second part of the proof, the upper bound (f33b . 


Consider the event {vf < eXX-ninCX) e)| fj rs t. There are two possible scenarios: Either the process exits from 
the cycle Cf ( x) before hitting A or not. Hence, 

P/3 (ta < e^ Ymin ^ x ' A ^~ £ ^ — 


= P 


< P 


Jr* < x < x \ I p / T x < T x < e P('¥ mjn (x,A)-£)\ 

T A < - e ' t A ^ T dC+(x)J P{ 9C+(x) ~ T A ) 

( -r x <" phF^mm(x,A) — e) X / X \ _t_ p ( J-X / ph('¥ m j n (x,A) — e)\ 

H A , A 3C+(x)) + U Z 3 V dc+(x) ^ e )■ 


(56) 


The quantity P^( T gc+( x ) < eZ’HminXH) E )j is exponentially small in f for f sufficiently large, thanks to Theo- 

rem I3.2H ) and to the fact that Y m ; n (x, A ) < V(Cf (x)). In order to derive an upper bound for the first term in the 
right-hand side of ( f56l I . we introduce the following set 

Zopt := {z£R a (x)\A I r(z,A) > Y min (x,A)}. 


By definition (f28l> of T r mm (x, A), every optimal path co £ must inevitably visit a cycle of depth not smaller than 
1 f r m i ri (x, A) and therefore it has to enter the subset Z opi before hitting A. Hence, for every z £ Z opt , conditioning on 
the event {tT < r* +, ., X T * = z}, we can write 

Z Q pf 


t\=t f + r 


A’ 
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and, in particular, ijj > st x\. Using this fact, we get that there exists some U > 0 such that for j6 sufficiently large 

P „ f t-'' < e P(fmmM)-e) r X <r X \ 

>( T J < ^w) p f (’i < I Tj < T« CJ(I) ) 

? />(t 5 < ’fciw) E Pf( t 5 < ^ |T - l ‘' 4) - ) I rS < y cjw , X T . = 2 )p„(: 

z62 0 pt F 


= F f 
< P, 


X T * = z 

2 opt 


<p, 


fa < ■&}(,>) E p fl (4<^ lT * ( " ,) - ) )p^K = 2 ) 

ze2 opt F 

£ fa < <ij w ) E ^fa £ ^ r( "»->)p ? (x^ = z) 


Z G -2opt 


<Vfi(T* A <Tlc 1(x) ) L F ^lc A (z)< emZ ' A) £) ) P ^( X rl opt 


Z G -2^>pt 


< P 


ffaWcJw) E; z-W-P^XtJ =z)=P^(tJ 

zGZopt 


= Z 


< ufa+, 
3 Uh 


• e 


-hfa 


< e~ hli , 


(57) 


where we used Theorem 13.21 0 and the facts that t\ > T gc A ( r ) an d th at r(C^(z)) — T{z,A) > A) for every 

Z G Ufipt;. 


For the upper bound, we can argue that 


P 


M t a 


> e /3(T max (x,A)+ £ )j = 



— pjr X a > e^HmaxUvlj+e) X <T X ) 

1L p[ L A ^ e ' l a ^ L dC+(x)J 

+ P >( 

Jl > e> 

< P o(t x a > fafamaxfa'fa+e) T A < T x 4. 1 

A R P\ l A ^ e ' ‘a ^ l dC%(x)) 

+ P >( 

JdC+(x) 


' T ac+(x) - t a } 


< t) 


The second term is exponentially small in j6 thanks to Theorem 13. 21 iii) applied to the cycle C A (x), to which both x 
and at least one state of A belong. We now turn our attention to the first term. 

If the Markov chain {Xf } fg jsj hits the target set A before exiting from the cycle CJ(x), then it has been following an 
optimal path and, in particular, before hitting A it can have visited only states in the set /fa fa) \ A. Consider a state 
z G R fa) \ A. By definition of R fa), z can be reached from x by means of an optimal path, i.e. there exists a path 
co* : z —> x such that Ty,* < <L>fa, A). This fact implies that <3>(z, A) < <l>(x, A) and thus for every path in a; G fi ° P A , 
we can obtain a path that belongs to by concatenating co* and a’. Hence, 


Y max (z ,A) <Y max (x,A). 


(58) 


Lemma 15 . 11 1 guarantees the existence of a cycle-path C\, ... ,C n vtj-connected to A such that z G Cj and Ci, ... C n G 
M.(X\A). From the fact that this cycle-path is vtj-connected and Lemma |3. 1 21 it follows that H (J- (3Q)) < <L>fa, A). 
Definition ( f29l l , inclusion d36t and inequality 1581 imply that 


r(Q) < Y max (x,A), i = l,...,n. 


For every i — 2 ,...,n take a state j/ ; - G -F(dC,_i) n Q. Furthermore, takei/i = z and y n+ \ G J-(dC n ) n A. Consider 
the set of paths 

£e,z,A ■ ^t,z,A (j/l/ h[, 1/2/1-2, • • • / Vrif C J; , Cf n +\ ) 

consisting of the paths constructed by the concatenation of any H-tuple of paths ah 1 ah 2 -',..., <fa"i satisfying the 
following conditions: 

(1) The path ofa) has length |o;(')| < efa'Lnaxfafa+Ufa f or an y j _ _ >n - 

(2) The path ofa) joins y L to y !+1 , i.e. ofa) G fiy lV y !+1 , for any i — 1 ,..., n; 
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(3) All the states co^ belong to Q for any j — l,..., |o/ ! )| — 1, for any i — 1 

We stress that the first condition restricts the set £ ErZrA to paths that spend less than e^(' F max(*A)+e/4) q me j n C y C ] e Q, 
for every! = 1 Note that the length of any path a; G £ £/Zi a satisfies the upper bound \co\ < \X\e^^ m:iX ^ x ' A ' ,+£ ^ 4: \ 
Moreover, since the state space X is finite, for /I sufficiently large 

\U)\ < |^|e^(Ym«(*^)+e/4) < e /3(T max (x,A)+£/2) Va; £ £ ^ a 
Therefore, for every z G Ra (x) \ A 

F^rl < > Ppfa < e^^ x ' A ^ /2 \(X m ) r } =1 G £ £ , Z , A ) = Pp((X m )J =1 G £ £ , Z , A ). 

Using the Markov property, we obtain that for any e' > 0 and j6 sufficiently large 

Pf}((Xm)% = 1 e ^,z,a) - flP^fe < ^-(^4) x y, = ym ) > 

V ' 1=1 V ' T 3C; 7 

where the second last inequality follows from Thcorem l3.21 v). Since e~P £ ' l I does not depend on the initial state z, 

inf Pa (t~ a < > e -m_ 

z€Ra(x)\A ti \ A - >- 


Applying iteratively the Markov property at the times rnaxUA ! ft /2i, w qp k — 1,. e^ 2 , we obtain that 

< ( sup Pg(> ^(^M)+ £ /2))] eP£/2 


P 


pfa > e^Ohna *(x,A)+e) /T x < T* c+(x J < ( sup P^(r| > e^mM^+e/Z)^ 

A Z£ R a( x )\A 


c (l-e-^WY < e 


_ e ^e/2-e’\X\) 


We remark that we can take the supremum over the states in R A (xj \ A, since all the other states in C A (x) \ R A (x) 
cannot be reached by means of an optimal path (i.e. without exiting from C A (x)) before visiting the target subset A. 
Choosing e' > 0 small enough and 1 6 sufficiently large, we get that e e P(tn £ 1*1) < e kp £ or 

any k > 0. □ 


4.3 Proof of Lemma r3.12l 

Take a path co G Cl XfA and consider the corresponding cycle-path C w — (C \,..., C„ ;(a ,)). We will show that 

co & ^xa 3i '■ 4>(o; i+1 ,A) > 4>(a;i, A). 

If co £ n v x \, then the cycle-path C w — (Ci,..., C m ( aij ) is not vtj-connected to A, i.e. there exists an index 1 < k < 
m(co) such that dC k H C k+ \ ^ 0, but kF(dC k ) H C k+ 1 — 0. Take the corresponding index i in the path co such that 
coi G C/ c and u>i+\ G dC k H Qt+i- Since C0j + 1 ^ J-(dC k ), 


<S>(,co i+1/ A) > H(co i+1 ) > H(T(dC k )) = <S>(co ir A), 

where the last equality follows from the fact that Q, = C’a (fu,). 

For the converse implication, suppose the path co contains two consecutive states co/ and to l+ \ such that <J>(a', + i, A) > 
<&(cOj, A). Consider the index k such that a’, G C k . The states a;, and a>,_| must be contained in two different cycles, 
otherwise if — C k — C^(o; ;+1 ), one would have 4>(o;; +1 , A) = A), which is a contradiction. Assume 

then that (Oj G C k for some k, and thus oy_i G c)C k IT C/ c+] by definition of a cycle-path. The state a; I+] cannot belong 
to J-{dC k ), since otherwise 


®(co i+1 ,A) < 0(^(3C fc ),A) - H{T(dC k )) = *(&7i,A), 
where we have used the fact that C k — C A (cOi). Hence cOj + \ ^ kF{dC k ) and thus co £ 


□ 
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4.4 Proof of Lemma [3.131 

In d39l> we have used the fact that the only way to exit from the tube (x) without having hit the subset A first is 
to exit from the non-principal boundary of a cycle C £ ‘X/i (x). Therefore 


P M t ot aW < t a) = 


E p «( 


cei A (x 


T-b < T A, X r x e C, X T x ^ ,F(3C) ) 

A' t 3t .(x) 1 ST a (x) ^ v V 


= E E p o (^ aM < x Tj[ , = 2 , X t , ^oo) 

Ces: A (x)zeC M ' 

< £ |C|supP j8 (x ifc 0 7-(3C)) < E |C|e-^<e"^ / 


Cex A (*) zee 


Cex A (x) 


for some k > 0 and /5 sufficiently large. The second last inequality follows from Theorem I3.21 iv). Thanks to the 
definition ([37l> of the typical tube, — T /\ j — 0, since all the states of the target state A that can be hit 

starting from x by means of a typical path belong to T^(x) and not to t)T A (x). □ 


4.5 Proof of Proposition 13.151 

As mentioned in Subection l3.41 this proposition is a refinement of Proposition [3TTDJ so instead of giving a full proof, 
we will just describe the necessary modifications. 

We first prove d44l> . Consider the event { < e £(©min (x,A) e)| fj rs j xhere are two possible scenarios: Either the 

process exits the tube T^(x) of typical paths before hitting A or it stays in T A (x) until it hits A. Hence, 

P^( T A < e ^( 0 min(r,A)-£)^ 

= p fi( T J < (•/’I®—< Tj y (i() ) + p p( 2 fr,w < Xa < 

< P f (tJ < Tj < T 8 y w ) + Pf, (t^ ( i) < Tj). (59) 

Lemma [3d3]implies that the second term in the right-hand side of < f59b is exponentially small in /3. In order to derive 
an upper bound for the first term in ((59), we introduce the set 

Zvtj := {z £ T a{x)\A I r(z,A) > 0 min (x,A)}. 

By definition (l40t of 0 m i n (x, A), every typical path to £ O'^ must inevitably visit a cycle of depth not smaller than 
©min ( x > A) and therefore has to enter the subset Z vt} before hitting A. Hence, for every z £ Z vt j, conditioning on the 
event {tT < r; v T , v X r x — zj, we can write 

ai al^xj z vt j 

T x A. T X I z 

r A ~ T z T T a , 

and in particular we have that t a > s t t|. Using this fact and arguing like in ll 57) . we can prove that there exists 
k > 0 such that /I sufficiently large such that 

< e /K<W^)- £)/Tl < < e -V 

We now turn our attention to the proof of the upper bound ((45). First note that 

P/i( T A > — 

= Vfifa > e /K<W*,A) +£ ) /T * < ^ (x) ) + P^( T * > e ^(0 m aJ ^) +£)/T ^ w < ijj 

< P^tJ > ^(®-x(,,a ) +£)/T5 < t * TaW ) +P / 3 (t- T aW < ri). 


(60) 


where the the latter term is exponentially small in j6 for f> sufficiently large, thanks to Lemma 13.131 For the first 
term in ||60) , we refine the argument given in the second part of the proof of Proposition 13.101 Consider a state 
z £ T^(x) \ A. Since T^(z) C T^(x), it follows from d42t that 


0max(z, A) < 0 

max (x,A). 


(61) 
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Thanks to Lemma [3.1 11 there exists a cycle-path of maximal cycles C\,... ,C n C in X \ A that is vtj-connected to A 
and such that z G Q. The definition of vtj-connected cycle-path. Lemma [3.141 and inequality (16 1 l l imply that 

T(Q) < 0 max (x,A), Vz = l,...,n. (62) 

For each i — 2 ,...,n, take a state y L G .F(3Q_i) n Q. Furthermore, take y\ — z and y n +i G T(dC n ) D A. We 
consider the collection of paths 


£ t,z,A &tz,A (y 1/ c i/3/2/ Q/ • ■ • /y«/ C„,y„ + i), 




which consists of all paths obtained by concatenating any n-tuple of paths aco 
ing conditions: 

(1) The path a/') has length |o/') | < e^( 0 max(*M)+ E /4), for any i — 1 

(2) The path joins y L to i//+i, i.e. am) G n,j iAJi+u for any i — 1,..., n; 

(3) All the states ay 1 belong to Q for any / = |a>W | — 1, for any i — 1,. 


LO' n '> satisfying the follow- 


n. 


This collection is similar to the collection £ e z a described in the proof of Proposition 13.101 but condition (1) here is 
stronger. Using (l62l l and arguing as in the proof of Proposition 13.101 we obtain that 

> P j6 ((X m )2 =1 G £* >ZiA ) > 


Since e P £ 1*1 does not depend on the initial state z, we get for any P > 0 and /3 sufficiently large 

inf Fr(t a < ^(©max(x,A)+ £ /2)\ > e -pef\X\ 
z€T A (x) p \ A > 


and thus 


sup Fp(r\ > e^( 0 max(x,A)+e/2)'\'\ 
zeT A (x)\A V J J 


p p (tS > < r s y w ) < ( 

\-n 


o^n 


(63) 


by applying iteratively the Markov property at the times ke^® max '- x,A l ! ~ £ / 2 ), with k — 1,..., e^ 2 . Choosing c 1 > 0 
small enough and f sufficiently large, we get that the right-hand side of inequality (l63t is super-exponentially small 
in f>, which completes the proof of the upper bound ( l45l i. □ 


4.6 Proof of Theorem 13.171 

Since Assumption (Al) holds, we set 0(x, A) — 0 mir ,(x, A) = 0 max (x, A). The starting point of the proof is the 
following technical lemma. 

Lemma 4.1 (Uniform integrability). If Assumption (A2) holds, then for any e > 0 the variables Y A (fS) := 

are uniformly integrable, i.e. there exists fio > 0 such that for any S > 0 there exists K G (0, oo) such that for any ft > fio 

P (’' / K/ 5 )1{y^(^)>k}) < ^ 

Proof. The proof is similar to that of f27l Corollary 3.5]. It suffices to have exponential control of the tail of the 
random variable Yf(fi) for f sufficiently large, i.e. 

P^YJOS) > n) - P^(rle-^ (0 ( x ' A)+£) > n) < a n , 

with a < 1. Assumption (A2) implies that 0 max (z, A) < 0(x, A) for every z G X\A. Then, iteratively using the 
Markov property gives 

fJtX > ne -pmx,A)+e)\ < ( supPfifr- > e^mx,A)+e)\\ n < ( sup p» f rf > e «0-M)+ E )]]' ! ( 

and the conclusion follows from Proposition l3.15l □ □ 
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Proposition l3.15l implies that the random variable Y X A (/3) := T^e £(®(*A) +e) conver g es to 0 in probability as j3 —> 
oo. Lemma l4Tl guarantees that the sequence (Yj!j(/3))is also uniformly integrable and thus lim^^ E| Y^(jS) | = 
0. Therefore, for any e > 0 we have that for /! sufficiently large Erjj < g/H©(*^)+ £ ). f ar as the lower bound is 
concerned, for any e > 0 Proposition [3T5] and the identity 0(x, A) — 0 mm (x, A) yield 

Erjt > e l 8 (®(*'- A )- £ / 2 )p^ T ^ > e 0( 0 (*A)-e/2)j > e 0( 0 (*A)- £ / 2 )(i - e~ K P) > e P(®(*<^)- £ ). 

Since £ is arbitrary, the conclusion follows. □ 


4.7 Proof of Theorem 13.191 


As mentioned before, the strategy is to show that the Markov chain { X t } t cjc satisfies the assumptions of [18. The¬ 
orem 2.3], which for completeness we reproduce here. For R > 0 and r £ (0,1), we say that the pair (x, A) with 
A C X satisfies Rec(R, r) if 

sup P/3 (t Z { XrA} >R)<r. 

The quantities R and r are called recurrence time and recurrence error, respectively. 

Theorem 4.2. l H8l Theorem 2.3] Consider a nonempty subset A C X and x £ A such that Rec(R(/5), r(/5)) holds and 

(i) lim^oo R(/S)/EtJ(/3) = 0, 

(ii) lim^^oo r(l 3) = 0. 

Then there exist two functions k\ (f>) and k^fC) with lim^oo k\ (fS) — 0 and lim^oo k 2 (fi) — 0 such that for any s > 0 


P f 


(JH 


> s — e 


< k^fte-t 1 -^^. 


(64) 


Since T(X \ (A U {x})) < 0(x, A) by assumption, we can take £ > 0 small enough such that T{X \(AU {x})) + 

£ < 0(x, A). Proposition l3.7l implies that there exists k > 0 such that the pair (x, A) satisfies Rec(e^ ; ' :, \( AU ( x }P +£ ), e~ K P) 
for l 6 sufficiently large, since 

_- e K/S 


supP^r^j > u{*}))+e)) < e - 


ZdX 


Clearly r(/3) = e~ e —> 0 as /3 —> oo and thus assumption (ii) holds. Assumption (i) is also satisfied, since 
lim \ log R{fS) — T{X \(AU {x})) + £ < 0(x, A) — lim \ logErY 

^->00 p /i->oo p 


□ 


4.8 Proof of Proposition 13.241 

The two limits in ( f54b are an almost immediate consequence of [9} Theorem 5.1] and [.55] Proposition 2.1]. Indeed, 
we just need to show that the critical depths H 2 and H 3 (see below for their definitions) that appear in these two 
results are equal to T{X \ {s}), for any s £ X s . The critical depth H 2 is equal to T(X \ {s}) by definition, see (9J. 
Note that this quantity is well defined, since its value is independent of the choice of s, as stated in (9] Theorem 5.1]. 
This critical depth is also known in the literature as maximal internal resistance of the state space X , see 11571 Remark 
4.4], 

The definition of the critical depth H 3 is more involved and we need some further notation. Consider the two- 
dimensional Markov chain (Xf,Yf) f >g, where Xf and Yt are two independent Metropolis Markov chains on the 
same energy landscape (X, H, q) and indexed by the same inverse temperature j 6 . In other words, (Xf, Yf) f >o is the 
Markov chain on X x X with transition probabilities P ® 2 given by 

P® 2 ((*,y), (w,z)) = Pp(x,w)Pp{y,z) V (x,y), (w,z) £ A" 2 . 

The critical depth H 3 is then defined as 

H 3 :—f(X x X \ D), 
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where D {(x,x) \ x £ X}. Consider the null-cost graph on the set of stable states, i.e. the directed graph (V,E) 
with vertex set V — X s and edge set 


£ = |( S/ s') e X s x X s 


lim —— 

^3—^00 j6 


\ogPp(s,s') = o}. 


[9] Theorem 5.1] guarantees that ££ < H 3 and states that if the null-cost graph has an aperiodic component, then 
H 2 — H 3 . We claim that this condition is always satisfied by a Metropolis Markov chain with energy landscape 
(X, H, q) with a non-constant energy function H. It is enough to show that for any such a Markov chain there exists 
at least one stable state s G X s such that 

The subset X \ X s is a non-empty set, since H is non-constant. Since q is irreducible, there exists a state s £ X s 
and x G X \ X s such that q(s, x) > 0. Furthermore, we can choose s 6 X s and x G X \ X s such that the difference 
H(x) — H(s) is minimal. For this stable state s, the transition probability towards itself reads 


Pp(s,s) = 1 - £ q(s,y)e-M H M- H W + 

yH 

= 1- £ £ ,( S ,y) e -KH<y>-«wr 

s'ex s ,s'^s yeX\X s 

>1- E E «(».y) 

s'eX s ,s'^s yeX\X s 

>1- X] q(s,s')-e-K H W~ H W + . 
s'ex s ,s'^s 


Since q is a stochastic matrix, it follows that 1 — Yj S '<sX s , s'^s q{ s > s ') > 0 independently of /5 and thus 


lim -- log Pp{s,s) - 0 , 

p r 


since for every e > 0 there exists fio such that Pp(s,s) > 1 — YL s 'eX s ,s'^s q{ s > s ') ~ e ^ H ^ > e ^ for j 6 > /3o- 

Finally, the bounds <l55l l follow immediately from [22- Theorem 2.1], since the quantity m which appears there is 
equal to T (X \ {s}) thanks to Lemma [3761 □ 


5 Energy landscape analysis for hard-core grid models 

This section is devoted to the analysis of the energy landscapes corresponding to the hard-core dynamics on the 
three different types of grids presented in Section|2] Starting from geometrical and combinatorial properties of the 
admissible configurations, we prove some structural properties of the energy landscapes for Xj KL , Xq kl and Xq k f . 
These results are precisely the model-dependent characteristics that are needed to exploit the general framework 
developed in Section|3]to obtain the main results for the hard-core model on grids presented in Subsection l2.3l These 
structural properties are stated in the next three theorems and the rest of this section is devoted to their proofs. 

Theorem 5.1 (Structural properties of Xj K[ ). Let A be the Kx L toric grid T^l- Then 

(a) r(T’\{e, o}) < min{fC, £}, 

(b) r(e, {o}) - min{fC, L} + 1 = f(X \ {o}). 

Theorem 15.11 implies that conditions ll48l l and d53t hold for the pair (e, {o}) in the energy landscape ( Xj KL , H, q). 
Hence Assumptions A and B are satisfied and the statements of Theorems 12. ll and l2.2l for a toroidal grid follow 
from Corollary 13.161 and Theorems 13 .1 71 and 13.191 respectively. 

Theorem 5.2 (Structural properties of Xq kl ). Let A be a K x L open grid Gk,l■ IfKL = 0 (mod 2), then 

(a) f (X \ {e, o}) < min{ \K/2], \L/ 2] }, 

(b) r(e, {o}) - min{ \K/ 2], \L/2]} + 1 = f (X \ {o}). 
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If instead KL = 1 (mod 2), then 

(a*) f(*\{e,o}) <min{\K/2\,\L/2\}, 

(b*) r(e, {o}) = min{ \K/2], [L/2] } + 1 = f (A” \ {o}) and T(o, {e}) = min{ \K/2], [L/2] } = T(,T \ {e}). 

We remark that in the case KL = 1 (mod 2), inequality (a*) is strict, while inequality (a) is not, and this fact is 
crucial in order to conclude that o is the unique metastable state of the state space Xq k when KL = 1 (mod 2). Us¬ 
ing Theorem l5.2l we can check that the pair (e, {o}) satisfies both Assumptions A and B (since both conditions d48l) 
and (f53l) hold) and thus prove the asymptotic properties for the hitting times and r° illustrated in the statements 
of Theorems [2d] and |Z2] for an open grid Gk,l- 

Theorem 5.3 (Structural properties of Xq kl ). Let Abe a K x L cylindrical grid Ck,l■ Then 

(a) f(A’\{e,o}) < min{K/2 / L}, 

(b) T(e, {o}) - mm{K/2, L} + 1 = f (X \ {o}). 

Using Theorem 15.31 we can check that Assumptions A and B are satisfied by the pair (e, {o}), and then the 
statements of Theorems l2.1l and l2.2l for a cylindrical grid Ck,l follow from Coro 11 ary 13.161 and Theorems l3.17l and l3.19l 
The ideas behind the proofs of these three theorems are similar, but for clarity we present them separately in 
Subsections 15.1115.21 and 15.31 


Denote T(A) T(X \ {e}), where ( X, H, q) is the energy landscape corresponding to the hard-core model on 
the grid A. In the case A — Gk,l with KL = 1 (mod 2), Theorem 15.21 gives that T(A) ~ min{ | K/2\, [L/2]}. In all 
the other cases by symmetry we have r(fb\{e}) — T(X \ { o}) and hence, from Theorems 15.1 1 fh2l a nd 15.31 it then 
follows that 


{ minjfC, L} + 1 
min{ \K/2], [L/2]} + 1 
min{[K/2], [L/2]} 
min{fC/2, L} + 1 


if A = T K/L , 

if A — G K/L and KL = 0 (mod 2), 
if A = Gk,l an d KL = 1 (mod 2), 
ifA = C K , L . 


Besides appearing in the two main theorems (Theorems 12.11 and 12.211 . the exponent T(A) also characterizes the 
asymptotic order of magnitude of the mixing time f™ x (e. A) and of the spectral gap pp( A) of the hard-core dynam¬ 
ics {Xf}f e ]N on A (see Subsection 13.811 . as illustrated in the next theorem. 


Theorem 5.4 (Mixing time and spectral gap). For any rectangular grid A and for any 0 < e < 1, 


h m 4 los '”“ (£ ' A) = r(A) = ,! i £r? k w (A) ' 


Furthermore, there exist two constants C\, C 2 independent of f such that 

Cie -m A ) < < C 2 e -b r ( A ). 

The proof readily follows from the properties of the energy landscapes illustrated in Theorems 15.1115.21 and 15.31 
and by applying Proposition l3.24l 


We next introduce some notation and definitions for grid graphs. Recall that A is a K x L grid graph with 
K, L > 2 which has N — KL sites in total. We define energy wastage of a configuration cr G X on the grid graph A as 
the difference between its energy and the energy of the configuration e, i.e. 

U(cr) := H(a) - H(e). (65) 

Since H(e) = —\N/2], we have that 

U{cr) = H(a) + \N/2] - \N/ 2] - *(*)■ 

veA 

Moreover, since e is a stable state, U(a) > 0. The function U : X -a ]R + U {0} is usually called virtual energy in the 
literature m chi and satisfies the following identity 

1 

ir(cr) = - lim -logger), 

^->oo p r 
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where up is the Gibbs measure ((3]) of the Markov chain {Xf} 


We denote by cj, j — 0,. .., L — 1, the y'-th column of A, i.e. the collection of sites whose horizontal coordinate is 
equal to y, and by r u i = 0,..., K — 1, the z-th row of A, i.e. the collection of sites whose vertical coordinate is equal 
to i. In addition, define the i-th horizontal stripe, with z = , |_X/2J , as 

S,- r 2 i~2 U r 2 i- 

and the j-th vertical stripe, with j —1,..., [L/2\ as 


Q C2j-2 U C 2 j-\- 

An important feature of the energy wastage U for grid graphs, is that it can be seen as the sum of the energy 
wastages on each row (or on each horizontal stripe). More precisely, let L lj(cr) be the energy wastage of a configura¬ 
tion a G X in the z-th row, i.e. 

Ui(<r) := \L/2] - £ <r(v). (66) 

ver. 

Similarly, let L if ( cr ) be the energy wastage of a configuration ct G X on the z-th horizontal stripe, i.e. 

Uf(cr) := L - £ a(v) - U 2; _ 2 (cr) + U 2i ^(a). (67) 

veSj 

Then, we can rewrite the energy wastage of a configuration cr 6 X as 

K \K/2] 

U{a) = Y J U i {a)= £ Uf{a). (68) 

i=l i =1 


Given two configurations cr, a' G X and a subset of sites W C A, we write 

cr |w = < 7 |w o-(v) = (/(v) WveW. 

We say that a configuration a G X has a vertical odd (even) bridge if there exists a column in which configuration a 
perfectly agrees with o (respectively e), i.e. if there exists an index o <;< L — 1 such that 

% = op. (respectively <J| C . = ep.). 

We define horizontal odd and even bridges in an analogous way and we say that a configuration cr £ X has an odd 
(even) cross if it has both vertical and horizontal odd (even) bridges. 



Figure 7: Examples of configurations on the 8x8 toric grid displaying odd bridges or crosses 

We remark that the structure of the grid graph A and the hard-core constraints prohibit the existence of two 
perpendicular bridges of different parity, e.g. a vertical odd bridge and a horizontal even bridge. Bridges and 
crosses are the geometric feature of the configurations which will be crucial in the following subsections to prove 
Theorems 15.1115.21 and 15.31 
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5.1 Energy landscape analysis for toric grids (Proof of Theorem 15.11 

This subsection is devoted to the proof of Theorem 15.11 valid for a toric grid Tk,l- Without loss of generality, we 
assume henceforth that K < L. Recall that by construction of the toric grid A, both K and L are even integers. In the 
remainder of the section we will write X instead of Xj K L to keep the notation light. 

We first introduce a reduction algorithm, which is used to construct a specific path in X from any given state in 
X \ (e, o} to the subset {e, o} and to show that 

f(X\{e,o})<K, (69) 

which proves Theorem 15.ll al. Afterwards, we show in Proposition l5.6l that 

<D(e,o) -H(e) >K + 1, 

by giving lower bounds on the energy wastage along every path e —t o. The reduction algorithm is then used again 
in Proposition l5.7l to build a reference path to* : e —> o which shows that the lower bound is sharp and hence 

<3>(e, o) — H(e) — K + l, 

which, together with (69) , proves Theorem 15.1 l b). 

The starting point of the energy landscape analysis is a very simple observation: A configuration in X has 
zero energy wastage in a given row (column) if and only if it has an odd or even horizontal (vertical) bridge. The 
following lemma formalizes this property. We give the statement and the proof only for rows, since those for 
columns are analogous. 

Lemma 5.5 (Energy efficient rows are bridges). For any a G X and any i — 0,... , K — 1, 

Ui(cr) = 0 <=+> <7| r . = e| r . or a jr . = 0 | r .. 

Proof. The z’-th row of the toric grid graph A is a cycle graph with L/2 even sites and L/2 odd sites. If a\ r . — ey. 
or a> r . — 0 | r ., then trivially there are L/2 occupied sites and hence Uf a) = 0. Noticing that the configurations ey. 
and oy. on row i correspond to the only two maximum independent sets of the cycle graph r, proves the converse 
implication. □ □ 

Reduction algorithm for toric grids 

We now describe an iterative procedure which builds a path to in X from an initial configuration // (with specific 
properties, see below) to state o. We call it reduction algorithm, because along the path it creates the even clusters are 
gradually reduced and they eventually disappear, since the final configuration is o. 

The algorithm cannot be initialized in all configurations a 6 X \ {o}. Indeed, we require that the initial config¬ 
uration a is such that there are no particles in the even sites of the first vertical stripe C\, i.e. 

E *<?) = (70) 

v g Ci n v e 

This technical assumption is required because the algorithm needs "some room" to start working, as will become 
clear later. 

The path to is the concatenation of L paths to (1 ),... ,co'- L K Path a ;u) goes from cy to cy + i, where we set a\ = a 
and we define recursively configuration aj + i from configuration cy as follows. 


Cj+ i ( p ) 


{ aj{v) if v G A \ (cj U cy+i), 
o(v) if v G Cj, 
aj(v) if v e Cj+i n v 0 , 

o if v e Cj + 1 n v e . 


Clearly, due to the periodic boundary conditions, the column index should be taken modulo L. It can be checked 
that indeed ty _. | = o. We now describe in detail how to construct each of the paths to^' 1 for j — 1,..., L. We build a 
path tu^i — ((Oq\co^\ .. -,C0 ]P + i) of length K + l (but possibly with void moves), with to^ — aj and to^ +1 — cy +1 . 
We start from configuration ow'°) = cy and we repeat iteratively the following procedure for all i — 0,..., K — 1: 
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• If z = 0 (mod 2), consider the even site v — (j + 1, i + (/ + 1 (mod 2))). 

- If cv^\v) — 0, we set oz^ — oz^ and thus H{ oz^) = H(oz-^). 

- If lo^\v) — 1, then we remove from configuration coj 2> the particle in v increasing the energy by 1 and 

obtaining in this way configuration oz^, which is such that — H(co^) + 1. 

• If /’ = 1 (mod 2), consider the the odd site v — ( j,i — 1 + (j + 1 (mod 2))). 

- If az^(zz) — 1, we set ozj^ = oz^* and thus F^oz^) — H(ozp). 

- If az-^ (v) — 0, then we add to configuration oo'p a particle in site v decreasing the energy by 1. We obtain 

in this way a configuration which is admissible because by construction all the first neighboring 

sites of u are unoccupied. In particular, the particle at its right (i.e. that at the site v + (1,0)) may have 
been removed exactly at the previous step. The new configuration has energy — H(oz ; - ) — 1. 

Note that for the last path all the moves corresponding to even values of i are void (there are no particles in the 
even sites of Co). The way the path oz^ is constructed shows that for every j — 1,..., L, 

H(a j+1 ) < H(o-j), 

since the number of particles added in (the odd sites of) column Cj is greater than or equal to the number of particles 
removed in (the even sites of) column Cj + \. Moreover, 

- H ( a j) + 1 

since along the path oz^ every particle removal (if any) is always followed by a particle addition. These two 
properties imply that the path oz : cr —> o created by concatenating oz ! " 1 j,..., uz'b satisfies 

< H(tr) + 1. 

Proof of Theorem 15. H al. It is enough to show that for every a 6 X \ {e, o} 

O(<r,o) -H(o-) < K, 

since inequality lf69l l then follows the equivalent characterization of T given in Lemma 13.61 To prove such an inequal¬ 
ity, we have to exhibit for every cr 6 X \ {e, o} a path oz : a —> o in X such that <f> a , = max^ ea , H(y) < H(ir) + K. We 
construct such a path oz as the concatenation of two shorter paths, to f 1 ) and U’ 1 ' 2 -, where cb 1 J : cr —t cr’, co^ 2 ' 1 : cr’ —> o 
and cr' is a suitable configuration which depends on cr (see definition below). 

Since cr e by assumption, the configuration cr must have a vertical stripe with strictly less than K even occupied 
sites. Without loss of generality (modulo a cyclic rotation of column labels) we can assume that this vertical stripe 
is the first one, C\, and we define 

b-= £ cr(v)<K- 1. (71) 

veCinv e 

Define cr' as the configuration that differs from cr only in the even sites of the first vertical stripe, i.e. 

cr'( v ).= J^) if ■o e A \ (Q n v e ), 

1 ' |o iive QnVe. 

The path oz^ 1 ) — (ozj 1 a zjw), with oz[^ = cr and — cr' can be constructed as follows. For i — 1,..., b, in 
step i we remove from configuration ur 1 the first particle in C| f'l V,, in lexicographic order obtaining in this way 
configuration ozj|q, increasing the energy by 1. Therefore the configuration cr’ is such that H(cr') — H(<r) — b and 

^o;W = ma xH(rj) < H(<r) + b. 

tjecvW 
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The path u/ 2 ^ : u' —> o is then constructed by means of the reduction algorithm described earlier, choosing cr' as 
initial configuration and o as target configuration. The reduction algorithm guarantees that 

$ a,(2) = max H(j/) <H{cr') + l. 
rjeco'. 2 ) 

The concatenation of the two paths on 1 ' and co 2 gives a path co : or -0 o which satisfies the inequality Ty, < 
H(cr) + b + 1 and therefore 

ED 

<&(cr,o) — H(a) < b+ 1 < K. □ 

Proposition 5.6 (Lower bound for <l>(e, o)). 


<J>(e,o)-H(e) >K + 1. 

Proof. We need to show that in every path co : e —> o, there is at least one configuration with energy wastage greater 
than or equal to K + 1. Take a path co — (uq, ..., co n ) G Q e ,o- Since e has no odd bridge and o does, at some point 
along the path co there must be a configuration co m * which is the first to display an odd bridge, horizontal or vertical, 
or both simultaneously In symbols 

m* min{m < n | 3i : (o; m )| r . = 0 | r . or 3; : (co m )| c = 0 | c }. 

Clearly m* > 2. We claim that LI((U m *-i) > K + 1 or U(co m *_ 2 ) > L + 1. We distinguish the following three cases: 

(a) co m * displays an odd vertical bridge only; 

(b) co m * displays an odd horizontal bridge only; 

(c) co m * displays an odd cross. 

These three cases cover all the possibilities, since the addition of a single particle cannot create more than one bridge 
in each direction. 

For case (a), let v* G A be the unique site where configurations oo m *~\ and co m * differ and assume that v* G q* 
for some 0 < i* < K — 1. By construction, v* must be an odd site and a> m i _ | (zh) ~ 0 and co,„> (v*) — 1. This 
means that two neighboring sites at the left and at the right of v* must be unoccupied for both configurations, in 
particular co m *_i(v* + (1,0)) = 0 and \(v* + ( — 1,0)) — 0, otherwise the addition of a particle in v* would 
not be allowed. Moreover, there must be another odd unoccupied site in the same r,*, otherwise the addition of a 
particle in v* would create also an odd horizontal bridge, which we assumed co m * does not have. Having at least 
two unoccupied sites both in r t * n V e and in r ; * n V 0 implies that 

Ui*(co m *-i) > 2. (72) 

Moreover we claim that the energy wastage in every row which does not contain site v* is also greater than or equal 
to 1. Indeed configuration to m *_\ cannot display any odd row (by definition of in*) and neither an even row, since 
co m *_i(v* + (1,0)) = 0 and cu m *_ \(v* + ( — 1,0)) = 0. Therefore for every i — 0,...,K — 1 and i f i* we have 
(co m *)\ r . f 0 | r , e|. and hence U, (to m ») > 1 by Lemma 1531 This, together with inequality {72}, implies 

K—l 

U(a>m‘-l) — Y2 ^i( w rn*) > K + 1- 

i=0 

For case (b) we can argue as in case (a), but interchanging the role of rows and columns, and obtain that 

Lf(o; m *_i) G L +1 ^ K. ~b 1. 

For case (c), the vertical and horizontal odd bridges that co m * has must necessarily meet in the odd site v*. Having 
an odd cross, co m * cannot have any horizontal or vertical even bridge. Consider the previous configuration cu m *_\ 
along the path co, which can be obtained from co m * by removing the particle in v*. From these considerations and 
from the definition of m* it follows that has no vertical bridge (neither odd or even) and thus, by Lemma[531 

it has energy wastage at least one in every column, which amounts to 

LI(co m *—\) > L. 
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If there is at least one column in which has energy wastage strictly greater than one, we get 

U(cv m *—i) > L + 1, 

and the claim is proved. Consider now the other scenario, in which the configuration has energy wastage 

exactly one in every column, which means U(to m *_i) = L. Consider its predecessor in the path to, namely the 
configuration to m *-2 ■ We claim that 

U{to m *- 2 ) — L + 1 . 

By construction, configuration to m *_2 must differ in exactly one site from a7 m *_\ and therefore 

U(co m *~ 2 ) — U(co m *_ 1 ) ± 1 . 

Consider the case where U(oo m *_ 2 ) — U(co m *~i) — 1 — L — 1. In this case the configuration to m *-2 must have 
a zero-energy-wastage column and by Lemma 15.51 it would be a vertical bridge. If it was an odd vertical bridge, 
the definition of m* would be violated. If it was an even vertical bridge, it would be impossible to obtain the odd 
horizontal bridge (which to m * has) in just two single-site updates, since three is the minimum number of single-site 
updates needed. Therefore 

U(to m *— 2 ) — U(to m *—i) + 1 — L + 1. Cl 

□ 


Proposition 5.7 (Reference path). There exists a path to* : e —> o in Xj K L such that 


- H(e) = K + 1. 


Proof. We construct such a path a;* as the concatenation of two shorter paths, to' 1; and o/ 2: , where 00 ! " 1 ' : e —> cr* 
and : cr* —> o, and prove that $ M (i) — H(cr*) — H(cr) + K and that $ u ( 2 ) = H(cr*) + 1 are satisfied, so that 
Ty,» — max ?ea ,t TI(ij) — H(e) + K + 1 as desired. The reason why to is best described as the concatenation of two 
shorter paths is the following: The reduction algorithm cannot in general be started directly from e and the path a/ 1 ) 
indeed leads from e to cr*, which is a suitable configuration to initialize the reduction algorithm. The configuration 
cr* differs from e only in the even sites of the first vertical stripe: 


(7*(v) 


e(v) if i; G A \ Ci, 
0 if i; e Ci. 


The path o;^ 1 ) — (to^\.. .,to^,f), with co^ — e and to^h canbe constructed as follows. Forz = 1,...,K, at 
step i we remove from configuration 07 ; - the first particle in Ci (~l V e in lexicographic order, increasing the energy 
by 1 and obtaining in this way configuration 07 ; -H. Therefore the configuration a’ is such that H(cr*) — H(e) = K 
and (i) — H(e) + K. The second path to l2> : cr* — » o is then constructed by means of the reduction algorithm, 
which can be used since the configuration cr* satisfies condition (f70b and hence is a suitable initial configuration for 
the algorithm. The algorithm guarantees that < l > a; ( 2 ) = H(a* ) + 1 and thus the conclusion follows. □ □ 


5.2 Energy landscape analysis for open grids (Proof of Theorem 15.21 1 

We now prove Theorem l5.2l valid for an open grid Gk,l- Also in this case, we assume without loss of generality that 
K < L. Recall that K and L are positive integers, not necessarily even as in the previous subsection. In the remainder 
of the section we will write X instead of Xq k l . 

We first introduce a modification of the previous reduction algorithm tailored for open grids. The scope of this 
reduction algorithm is twofold. It is used first to build a specific path in X from any given state in X \ {e, 0 } to the 
subset (e, 0 } and to prove that if KL = 0 (mod 2), then 

f(*\{e,o})< \K/2], (73) 

which is Theorem 15.2f a ). The same argument also shows that if KL = 1 (mod 2), then 

f(*\{e,o}) < \K/2], (74) 
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and also Theorem 15.21 3*) is proved. By giving a lower bound on the energy wastage along every path e —> o, we 
show in Proposition [h9] that 

0>(e,o)-H(e) > \K/2] +1. 

Then, using again the reduction algorithm for open grids, we construct a reference path co* : e —>■ o which proves 
that the lower bound above is sharp and hence 

<b(e,o)-H(e) = \K/2] +1. (75) 

In the special case KL = 1 (mod 2), since Ofo, e) — <J>(e, o) and H(o) — H(e) + 1, we can easily derive from the 
last equality that 

<l>(o,e) — H(o) — [K/2], (76) 

Lastly, we combine inequality (f73l> and equation d75l> to obtain 

f(*\M) = \K/2\+l, 

which concludes the proof of Theorem l5.21 bl. In the special case KL = 1 (mod 2), inequality (f74b and equation ( f76b 
prove Theorem l5.21 b*l. since they yield that 

f(^\{e})= \K/1\. 

We need one additional definition: Say that a configuration in X displays an odd (even) vertical double bridge if there 
exists at least one vertical stripe S, in which configuration J perfectly agrees with o (respectively e), i.e. if there 
exists an index 1 < j < [L/2J such that 

a \Cj = °\Cj (respectively C7j c . = e| C .). 

An odd (even) horizontal double bridge is defined analogously. The two types of double bridges are illustrated in 
Figure |U 

Observe that an admissible configuration on the open grid has zero energy wastage in a horizontal (vertical) 
stripe if and only if it has an odd or even horizontal (vertical) bridge in that stripe. The next lemma formalizes 
this property. We give the statement and the proof only for horizontal stripes, since those for vertical stripes are 
analogous. In the special case of an open grid where KL = 1 (mod 2), the topmost row and the leftmost column 
need special treatment, since they do not belong to any stripe. The second part of the following lemma shows 
that an admissible configuration has zero energy wastage in that row/column if and only if they agree perfectly 
with e therein. Again we will state and prove the result for the topmost row, the result for the leftmost column is 
analogous. 




(a) Odd horizontal double bridge (b) Odd vertical double bridges 

Figure 8: Examples of configurations on the 8x8 open grid displaying an odd double bridge 


Lemma 5.8 (Energy efficient stripes are double bridges). Consider a configuration a G X. 
(a) For any i — 0,..., [K/2j — 1, the energy wastage Uf(cr) in horizontal stripe S, satisfies 

Uf (cr) — 0 <*=>• <T\ S . = e\ s . or o) s . = 0 | Sj . 
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(b) If additionally KL = 1 (mod 2), then the energy wastage in the topmost row ( cr ) satisfies 


U K -ip) - 0 


a VK -1 e kx-i‘ 


Proof, (a) Consider the rectangular 2 x L grid graph induced by the horizontal stripe S,: It has L even sites and L 

odd sites. If trig. = ejg. or cr| S . = 0 | S ., trivially Llf (cr) — 0. Let us prove the converse implication. Denote by (ef) 

the number of particles present in even sites in the top (bottom) row of stripe S,. Analogously, define ot (of) as the 
number of particles present in odd sites in the top (bottom) row of stripe S, . We will show that: 

(i) Uf(cr ) = 0 and e t + ej, — 0 -t=>- o t + oj, — L; 

(ii) Uf(cr) — 0 and e t + ej, > 0 -t=>- o t + Of, = L. 

Cases (i) and (ii <=) are immediate. Thus we focus on the implication (ii =^>). 

Note that if et + cy E [1, L — 1] particles are present in even sites, then they block at least e? + ej, +1 odd sites, which 
must then be unoccupied. Indeed in the top row each of the et particles blocks the odd node at its right and in the 
bottom row each of the e\, particles blocks the odd node at its left. In one of the two rows, say the top one, there is 
at least one even unoccupied site and consider the even site at its right where a particle resides. This particle blocks 
also the odd site at its left. Hence o t + oj, < L — (e t + e\, + 1), which gives Lf ( s (cr) — L — (et + e& + Ot + of) >0. 


(b) The topmost row has Pp even sites and pL odd sites. Denote by e (respectively o) the number of particles 
present in even (respectively odd) sites in row r^-i- The energy wastage of cr on this row can be computed as 
Uk-i(ct) = Pp — e — o. Trivially, if <J\ rK1 — then e — pp and thus Uk-i(ct) — 0. Let us prove the opposite 

implication. Assume that U|, K 1 f e ,. K ,, i.e. e < pL. If e — 0, then Uk-i(ct) > 1, since o < pp. If instead 
e £ [1, ^p — 1], then each particle residing in an even site blocks the odd site at its left, therefore o < ^p- — e, which 
implies 


U K -iP) 


L + l 
2 


— e — o > 


L + l 



> 1. 


□ 

□ 


Reduction algorithm for open grids 

We now describe the redaction algorithm for open grids, which is a modification of the reduction algorithm for toric 
grids that builds a path a’ in X from a given initial configuration cr to either o or e. The reduction algorithm for open 
grids takes two inputs instead of one: The initial configuration cr and the target state which is either o or e. This 
is the first crucial difference with the corresponding algorithm for toric grid, where the target configuration was 
always o. In the following, we first assume that the target state is o and illustrate the procedure in this case. The 
necessary modifications when the target state is e are presented later. 

The initial configuration cr for the reduction algorithm must be such that there are no particles in the even sites 
of the first column cq, i.e. 

E <r( v ) = ( 77 ) 

veconv e 

This condition ensures that the algorithm has enough "room" to work properly. Note that condition d77b is different 
from condition d70b for the reduction algorithm for toric grids, which requires instead that the even sites of both the 
first two columns cq and c\ should be empty. 

The path co is the concatenation of L paths a/ 1! ,..., a/ L ). Path cod) goes from crj to o~j + -\, where we set cq = t] 
and define recursively configuration <7y+i from configuration o y for every j — 1,..., L as 

{ (Tj(v) if v e A\ (cj\JCj +1 ), 
o(v) if v e Cj, 

crj(v) if v e c/+i n v 0 , 

0 if v £ Cj -|-i n v e . 

This procedure guarantees that C7y + i = o. The path ojL : for j — 1,..., L is constructed exactly as the path to^P for the 
reduction algorithm for toric grids. Since their construction is identical, every path a enjoys the same properties 
as those of the original reduction algorithm, namely 

H(<r j+ i) < H(crj) and ^ < H(«r y ) + 1. 



36 





This means that the path co : cr —> o created by their concatenation satisfies 

®u, < H(cr) + 1. 

In the scenario where the target state is e, three modifications are needed. First the initial state cr must be such 
that there are no particles in the odd sites of the first column Cq, i.e. 

E a< /°) = °- 

veco nv e 

Secondly, the sequence of intermediate configurations oy j — 1,... ,L must be modified as follows: We set cq = <7 
and we define recursively &j+\ from <Xj as 

{ (Tj(v) if V e A\ (cyUCy+i), 
e(v) if vecj, 

o-j(v) if v e Cj +1 n v e , 

0 if v E Cj + \ n V 0 . 

Lastly, for step i of path co^\ we need a different offset to select the site v, namely v — (j, i + (j (mod 2))) when i = 0 
(mod 2) and v — ( j,i — 1 + (j (mod 2))) when i = 1 (mod 2). One can check that the resulting path co : cr —> e 
satisfies the inequality 

< H(cr) + 1. 

Proof of Theorem l5.21 al and (b). It is enough to prove that for every cr 6 X \ {e, o} 

<E>(c r , {e,o}) — H(cr) < [K/2\. 

Indeed, this claim, together with the equivalent characterization of T given in Lemma 13.61 proves simultaneously 
inequality lf73ll when KL = 0 (mod 2) and the strict inequality d74l) when KL = 1 (mod 2), since in this case 
L-K/2J < | K/2 \. To prove such an inequality, we have to exhibit for every cr e X \ {e, o} a path u> : <r ^ {e, o} in 
X such that <f> a , = max, ;gw H(rj) < H(a) + [K/ 2J. 

Let b be the number of particles present in configuration cr in the odd sites of the leftmost column of A, i.e. 

&:= E 

vec 0 nv o 

Every column in A has [K/2] odd sites, and hence 0 < b < [K/2], Differently from the proof of Theorem 15.1 l ab 
here the value of b determines how the path co will be constructed. We distinguish two cases: (a) b = |_L/2J and (b) 
b<[K/2\. 

(a) Assume that b — [K/2 J. In this case, we construct a path co : cr —y o by means of the reduction algorithm for 
open grids, choosing as initial configuration cr and as target configuration o. The way this path is built guarantees 
that Oo, < H (cr) + 1, which implies that 

<5(cr,o) — H(cr) = 1 < [K/2]. 

(b) Assume that b < |_L/2J. In this case we create a path co : j —» e as the concatenation of two shorter paths, 
on 1 ) and a/ 2 ), where co l 1 ) : cr — > u' , a’^ 2 ^ : cr' —> e and cr' is a suitable configuration which depends on cr (see 
definition below). The reason why co is best described as concatenation of two shorter paths is the following: Since 
b < [K/2], the reduction algorithm can not be started directly from a and the path a/ 1 ) indeed leads from cr to cr', 
which is a suitable configuration to initialize the reduction algorithm for open grids. The configuration cr' differs 
from cr only in the odd sites of the first column, that is 

s , = icr(v) if v e A\ (c 0 n V 0 ), 

a[v) -\o ifz>ec 0 nv o . 

The path a/ 1 ) — {co^\..., with co^ — a and a — cr', can be constructed as follows. For i — 1,..., b, at 

( 1 ) 

step i we remove from configuration co] the topmost particle in cq PI V 0 increasing the energy by 1 and obtaining 
in this way configuration cof E. Therefore the configuration cr' is such that H(cr') — H(cr) = b and 

$ a;(i) = ma * H (v) <H(cr) + b. 

rje oA 1 ) 
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The path o/ 2 ^ : j' —>■ e is then constructed by means of the reduction algorithm for open grids described earlier, 
using cr' as initial configuration and e as target configuration. The reduction algorithm guarantees that 

‘JV 2 ) = m ax H(rj) < H(tr') + 1. 

The concatenation of the two paths co ( 1J and u>' 2: gives a path to : a —» e which satisfies the inequality <fy, < 
H(c) + b + 1 and therefore 

<&((r,e) -H(cr) = b + 1 < [K/2\. □ 

Proposition 5.9 (Lower bound for <l>(e, o)). 

4>(e,o)-H(e) > \K/2] +1. 

Proof. It is enough to show that in every path co : e —» o there is at least one configuration with energy wastage 
greater than or equal to \K/2] + 1. Take a path co — (co \,.. ■ ,co n ) 6 n e , 0 - Since e does not have an odd bridge 
while o does, at some point along the path co there must be a configuration co m * which is the first to display an odd 
bridge, horizontal or vertical, or both simultaneously. In symbols 

m* min{;« < n \ 3i : (co m )| r . = 0 | r . or 3 j : (to m )\ Cj — 0 | e }. 

Clearly m* > 2. We claim that > [1^/2] + 1 or U{aj m *- 2 ) > [L/2] + 1. We distinguish the following 

three cases: 

(a) co m * displays an odd vertical bridge only; 

(b) co m * displays an odd horizontal bridge only; 

(c) co m * displays an odd cross. 

These three cases cover all possibilities, since the addition of a single particle cannot create more than one bridge in 
each direction. Let v* 6 Abe the unique site where configuration to m >-\ and co m * differ. 

For case (a), assume first that v* belong to the z'*-th horizontal stripe, i.e. v* £ S,* for some 0 < i* < [K/2\ — 1. 
By construction, v* must be an odd site and co m *_i(v*) — 0 and cv m *(v*) — 1 and thus Uf t (co m *-\) > 1. We claim 
that in fact 

> 2 . 

It is enough to show that f 1. Suppose by contradiction that — 1, then it must be the case 

that Uf,(cv m *) = 0 , due the addition of a particle in if , and by Lemma [5.81 the horizontal stripe S,- must agree fully 
with o (co m * f e, since it has a particle residing in v* which is an odd site). This fact would imply that co m * has an 
odd horizontal bridge, which contradicts our assumption for case (a). 

Assume instead that K is odd and that v* does not belong to any horizontal stripe and belongs instead to the 
topmost row, i.e. v* £ Yk- \ ■ By construction, v* must be an odd site and (v *) = 0 and co m * {v*) — 1 and thus 

^ 1- claim that in fact 

> 2. 

It is enough to show that lfK-i(oz m *_i) f 1. Suppose by contradiction that b /k —I ( a ’nr-l) — 1/ then it must be 
U K _ | (co m t ) — 0, due to the addition of a particle in v*. By Lemma [5. 81 a;,,,* must agree fully with e on this topmost 
row, but this cannot be the case since co m * has a particle residing in v* which is an odd site. 

Moreover, we claim that the energy wastage in every horizontal stripe that does not contain site v* (and in 
the topmost row if KL = 1 (mod 2) and v* f r«_|) is also greater than or equal to 1. Indeed, configuration 
oo m *_\ cannot display any horizontal odd bridge (by definition of i*) and neither a horizontal even bridge, since 
a’ m *_ 1 (iz* + (1,0)) = 0 and a;,„*_ 1 (i;* + (—1,0)) = 0. Therefore for every i — 1,..., [K/2\ such that v* ^ Sj we 
have (oj m *)\ Sj 0 | S .,e| S . and hence, by Lemma l5.8l 

Uf (u’ m *) > 1 . 
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If K is odd, then the topmost row Tk- \ cannot be a horizontal odd bridge (our assumption would be violated) and 
neither a horizontal even bridge (it would be impossible to obtain the horizontal odd bridge which a) m * has in a 
single step, the minimum number of steps needed is two). Therefore, by Lemma (5751 

Ur- i) > 1. 


There are three possible scenarios: 

• K even: There are K/2 — 1 horizontal stripes with positive energy wastage and > 2; 

• K odd and v* tk-Y- There are |_fC/2j — 2 horizontal stripes with positive energy wastage, lf/c_ i |) > 1 

and lf?*((U m *_i) > 2; 

• K odd and v* E Yk- \ '■ There are |_fC/2J — 1 horizontal stripes with positive energy wastage and LJ/<■_. ] ) > 

2. 

In all three scenarios, by summing the energy wastage of the horizontal stripes (and possibly that of the topmost 
row) we obtain 

U (> [K/2] + 1. 

For case (b) we can argue in a similar way, but interchanging the roles of rows and columns, and obtain that 

i) > [L/2] +1 > [K/2] + 1. 

For case (c), the vertical and horizontal odd bridges that co m * has must necessarily meet in the odd site v*. Having 
an odd cross, co m * cannot display any horizontal or vertical even bridge. Consider the previous configuration a ’ m *_\ 
along the path co, which can be obtained from co m * by removing the particle in v*. From these considerations and 
from the definition of m* it follows that i has no vertical bridge (neither odd or even) and thus, by Lemma 15.81 
it has energy wastage at least one in each of the [L/2J vertical stripes and possibly in the leftmost column, if L is 
odd. In both cases, we have 

U{w m .- 1) > [L/2], 

If there is at least one column in which <u m »_\ has energy wastage strictly greater than one, then the proof is con¬ 
cluded, since 

U(co m *_ i) > [L/2] +1 > [K/2] +1. 

Consider now the other scenario, in which the configuration co m -| has energy wastage exactly one in every vertical 
stripe (and possibly in the leftmost column, if L is odd), which means H((U m *_i) = [L/2]. Consider its predecessor 
in the path co, namely the configuration We claim that 

U(cV m *-2) - [1/2] +1. 

Indeed, by construction, configuration cu m *-2 must differ in exactly one site from and therefore 

2 ) l L((U m *_i) ± 1. 

Consider the case where U(cu m *- 2 ) — U(— 1— [L/2] — 1. In this case the configuration cu m *-2 must have 
a zero-energy-wastage vertical stripe and by Lemma 15.81 it would be a vertical double bridge. If it was a vertical 
odd double bridge, the definition of m* would be violated. If it was an even vertical double bridge, it would be 
impossible to obtain the horizontal odd bridge (which co m * has) in just two single-site updates, since three is the 
minimum number of single-site updates needed. Therefore 

U(co m *—2 ) — L+1 — [L/2] +1. □ 

□ 


Proposition 5.10 (Reference path). There exists a path co* : e —>■ o in Xc K , S!<c/i that 

^•-H(e) - [K/2] +1. 
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Proof. We describe just briefly how the reference path a;* is constructed, since it is very similar to the one given in 
the proof of Proposition [577] Also in this case, the path a;* is the concatenation of two shorter paths, ok 1 J and a;' 2 - 1 , 
where to ' 1 ; : e —> cr* and co' 2} : cr* o, where cr* is the configuration that differs from e only in the even sites of 
the leftmost column: 


e(z>) if v G A \ Co, 
0 if v £ cq. 


The path on 1 ) consists of \K/2] steps, at each of which we remove the first particle in cq f] V,, in lexicographic order 
from the previous configuration. The last configuration is precisely cr *, which has energy H(cr*) — H(e)+ | K/2], 
and, trivially, <E> uj = H(e) + [K/2] . The second path co^ : cr* —> o is then constructed by means of the reduction 
algorithm, which can be used since configuration cr* is a suitable initial configuration for it, satisfying condition (1771 . 
The algorithm guarantees that = H(cr*) + 1 and thus the concatenation of the two paths a/ 1 ) and to 121 yields 
a path co* with <f> w * = max );6a , H(rj) — H(e) + \K/2] + 1 as desired. □ □ 


5.3 Energy landscape analysis for cylindrical grids (Proof of Theorem 15.31 1 

In this subsection we briefly describe how to proceed to prove Theorem 15.31 The cylindrical grid Ck,l is a hybrid 
between the toric grid and the open grid, since the columns of Ck,l have the same structure as the columns of the 
toric grid T^i, while the horizontal stripes of enjoy the same structural properties of those of the open grid 
Gk,l- Along the lines of Lemmas 15.51 and 15.81 we can prove that the only columns with zero energy wastage are 
vertical bridges and the only horizontal stripes with zero energy wastage are horizontal double bridges. 

In order to prove that 

<h(e, o) - H(e) > min{K/2, L} + 1, 

one can argue in a similar way as was done for the other two types of grids. Also for the cylindrical grid, in any 
path co : e —> o there must be a configuration co m * which is the first to display a horizontal odd bridge or a vertical 
odd bridge or both simultaneously, i.e. 


m* min{m < n | 3i : (cu m )| r . = 0 | r . or 3; : (co m )\ Cj — Op }. 


One can prove that 


max{U(co m *-i),U(co m *- 2 )} > min{K/2,L} + 1. 


We distinguish two cases, depending on whether K/2 > L or K/2 < L. In these two cases, the proof can be obtained 
by studying the energy wastage either in the columns or in the horizontal stripes, in the same spirit as for the toric 
and open grids in Subsections l5.ll and l5.21 respectively. Moreover, depending on whether K/2 > L or K/2 < L, we 
can take the reference path co* to be the same as in Subsections 15. Il and l5.21 respectively. Lastly, one can show that 


T(X \ (e, o}) < min{K/2, L}, 


by exploiting what has been done in Subsection 15.11 if K/2 > L, and the strategy adopted in Subsection 15.21 other¬ 
wise. 


6 Conclusions 

We have studied the first hitting times between maximum-occupancy configurations and mixing times for the hard¬ 
core interaction of particles on grid graphs. In order to do so, we extended the framework (27l for reversible 
Metropolis Markov chains. We expect that similar results for the first hitting time rf with a general initial state x 
and target subset A can be proved for irreversible Markov chains that satisfy the Friedlin-Wentzell condition dT4t . 
Furthermore, we developed a novel combinatorial method for grid graphs, valid for various boundary conditions, 
which shows that the energy landscape corresponding to hard-core dynamics on grid graphs has no deep wells and 
yields the minimum energy barrier between the two chessboard configurations e and o. We obtained in this way 
results for the asymptotic behavior of the first hitting time r® in the low-temperature regime. We expect that our 
combinatorial approach can be exploited to prove similar results for other graphs which can be embedded in a grid 
graph (e.g. triangular or hexagonal lattice) or for the hard-core model where there are two or more types of particles 
and the hard-core constraints exist only between particles of different type. The study of the critical configurations 
and of the minimal gates along the transition from e to o was beyond the scope of this paper and will be the focus 
of future work. 
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